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NONSYMMETRIC  OPERATORS    T    AND    S 

ABSTRACT 

In  this  paper  we  investigate  a  general  eigenvalue  problem    (i)    Tu-XSu  =  0, 
where    T    and    S    are  some  linear  (in  general  unbounded  and  non-selfadjoint)  opera- 
tors in  a  complex  or  real  Hilbert  space  from  the  point  of  view  of  the  existence  of 
eigenvalues  and  eigenvectors  of  (i)  and  their  approximate  calculation.     Thus,   after 
studying  in  Chapter  1  the  properties  of  a  certain  class  of  K- positive  definite  and 
K-real  operators,   we  use  these  properties  in  Chapter  2  to  investigate  the  problem 
of  existence  of  eigenvalues  and  eigenvectors  of  (i)  and  their  properties  under  vari- 
ous conditions  on    T    and    S.      In  Chapter  3  we  prove  the  applicability  and  the  unify- 
ing property  of  the  generalized  method  of  moments  for  the  approximate  solution  of 
(i)  while  in  Chapter  4  we  present  and  investigate  a  fairly  general  iterative  method 
for  the  approximate  solution  of  (i)  which  does  not  require  the  preliminary  reduction 
of  (i)  to  an  equivalent  problem  with  bounded  operators.     The  above  study  allows  us 
to  formulate  eigenvalue  problems  and  to  solve  them  approximately  also  for  the  case 
when    T    and    S    are  nonsymmetric  differential  operators  of  an  odd  or  even  order. 
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ON  THE  EIGENVALUE  PROBLEM    Tu  -  XSu  -  0    WITH  UNBOUNDED  AND 
NONSYMMETRIC  OPERATORS    T    AND    S* 

W.  V.   Petryshyn  ' 

Introduction 

During  the  last  quarter  of  the  century  the  problem  of  proving  the  existence 
and  deriving  the  fundamental  properties  of  eigenvalues  and  eigenvectors  of  the  prob- 
lem:     (i)    Tu  -  XSu  -  0,     where    T    and    S    are  some  linear  operators  in  a  Hilbert 
space,   as  well  as  the  problem  of  solving  (i)  approximately,  have  been  investigated  by 
many  authors.  However,   the  investigation  of  both  of  these  problems  was  limited 

mostly  to  selfadjoint  bounded  and  unbounded  differential  and  abstract  operators. 

Thus,   when    T    and    S    are  unbounded  and  selfadjoint  operators,  the  first  of 
these  problems  was  considered  in  [4,    5,   8,    10,    14,    25]  in  case  the  operators  are 
positive  definite  and  in  [12,    18,    36]  in  case    S    is  the  identity  and  the  spectrum  of    T 
contains  at  most  eigenvalues  of  finite  multiplicity.     The  investigation  of  the  second 
problem,  both  for  bounded  and  unbounded  operators,  led  to  the  development  of  a  group 
of  direct  methods  of  Ritz,   Galerkin,    moments,   and  others  [4,   5,    10,    14,   15,    16,    25, 


The  work  presented  in  this  paper  is  partly  supported  by  the  AEC  Computing 
and  Applied  Mathematics  Center,   Courant  Institute  of  Mathematical  Sciences,   New 
York  University,   under  Contract  AT(30-1)-1480  with  the  U.   S.   Atomic  Energy  Com- 
mission,  and  the  National  Science  Foundation. 

Temporary  member  at  the  Courant  Institute  of  Mathematical  Sciences. 

(#) 

It  should  be  remarked  that  the  list  of  papers  cited  in  this  report  contains 

only  those  papers  to  which  a  direct  reference  is  made  and  therefore  has  no  claim 
on  completeness  of  the  numerous  contributions  to  the  eigenvalue  problem  (i)  with 
bounded  and  unbounded  operators.     For  the  discussion  of  various  procedures  men- 
tioned here  and  for  a  more  complete  list  of  references  see  [4,    6,    10,    16,    37]. 
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27]  and  a  group  of  iterative  methods  of  gradient  type  [1,   2,   3,    6,    11,    13,    16,    17, 
19,    20,   22,   29,    30,   33,   37].     Unfortunately,   as  was  pointed  out  in  [16],  the  study 
of  the  gradient  methods  was  almost  [33]  entirely  restricted  to  finite  selfadjoint 
and  positive  definite  matrices  and  bounded  operators. 

The  purpose  of  this  paper  is  first  to  investigate  the  properties  of  a  cer- 
tain class  of  unbounded  and  nonsymmetric  operators,   the  so-called  K-positive 
definite  and  K^real  operators,   and  then  extend  to  this  larger  class  of  problems 
of  type  (i)  the  results  of  the  above  authors  concerning  both  the  problem  of  exist- 
ence of  eigenvalues  and  eigenvectors  and  the  problem  of  the  approximate  solution 
of  (i)  by  means  of  extending  the  applicability  of  the  procedures  mentioned  above. 
This  approach  allows  us  not  only  to  present  these  seemingly  different  methods  in 
a  more  unified  manner  but  also  to  introduce  a  fairly  general  iterative  method,   in 
some  sense  a  gradient  method,   for  the  direct  solution  of  the  eigenvalue  problem 
(i)  with  unbounded  and  nonsymmetric  operators.     Thus,   our  generalization  will 
allow  us  to  apply  the  theory  and  the  various  methods  to  the  non- selfadjoint  differ- 
ential and  integrodifferential  eigenvalue  problems  of  an  even  and  odd  order. 

The  author  is  greatly  indebted  to  Professors  E.   Isaacson  and  P.   D.   Lax 
for  their  interest  and  suggestions  offered  in  the  course  of  preparation  of  this 
paper.     Thanks  are  also  due  to  Professor  Co  De  Prima  for  numerous  stimulating 
discussions  concerning  some  topics  of  this  paper. 

We  now  begin  with  the  brief  summary  of  the  results  presented  below. 

In  Chapter  1  we  define  a  more  general  class  of  K=p.  d.   operators  than 
that  considered  in  [23,   26]  and  derive  some  of  their  properties.     Thus,   in  Sec- 
tion 1.  1  we  extend  to  this  class  the  results  obtained  in  [26]  concerning  the  gen- 
eralized variational  principle,   the  K-p.  d.   solvable  generalized  Friedrichs  exten- 
sions, and  others.     In  Section  1,  2  we  construct  a  Friedrichs  extension  for  a  non- 
K-p.  d.   operator  under  weaker  conditions  than  those  in  [26]  and  by  specializing 
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the  operators  derive  from  it  directly  the  corresponding  results  of  Lax  and  Milgram 
[21].     In  Section  1.3  it  is  shown  that  the  results  obtained  in  Sections  1.  1  and  1.  2 
together  with  the  recent  results  of  Polsky  [28]  guarantee  the  applicability  of  the 
comparison  principle  and  the  projectional  methods  for  the  solution  of  a  compli- 
cated operator  equation.     In  Section  1.  4  we  introduce  the  concept  of  H   -bounded 

o 

operator  and  prove  some  of  the  properties  about  the  boundedness  and  compactness 
(complete  continuity)  of  such  operators,   while  in  Section  1.  5  we  use  these  prop- 
erties to  derive  the  error  estimate  for  the  generalized  Ritz  method.     As  a  special 
case  we  obtain  the  recent  result  of  Stadnikova    [35], 

Chapter  2  deals  with  the  question  of  existence  of  eigenvalues  and  eigen- 
vectors for  the  problem  (i)  and  their  properties.     Thus,   in  Sections  2.  1  and  2.  2 
we  extend  to  the  eigenvalue  problem  (i)  the  corresponding  results  obtained  in  [8, 
25]  while  in  Section  2.3  we  consider  conditions  under  which  the  results  derived  in 
[14,    36,    18,    12]  are  also  valid  for  the  eigenvalue  problems  of  type  (i)  with  K-p.  d. 
operators.     Let  us  add  that  using  the  properties  of  H   -bounded  operators  derived 
in  Chapter  1  we  prove  here  also  a  stronger  assertion  concerning  the  Fourier  ex- 
pansion and  the  completeness  of  eigenvectors  than  those  obtained  in  [18,  12], 

In  Chapter  3  we  indicate  briefly  the  applicability  and  the  unifying  prop- 
erty of  the  generalized  method  of  moments  for  the  approximate  solution  of  the 
eigenvalue  problem  (i);    the  chapter  contains  slight  generalization  of  the  corre- 
sponding results  of  Mikhlin  [25]  and  Polsky  [27j.     In  fact,   in  Section  3.  1  we  prove 
two  lemmas  which  generalize  the  results  of  Polsky  [27]  while  in  Section  3.  2,   fol- 
lowing Mikhlin,   we  formulate  the  generalized  method  of  moments  for  the  approxi- 
mate calculation  of  eigenvalues  and  eigenvectors  of  (i)  and  then  prove  its  con- 
vergence.    In  Section  3.  3  we  show  that  by  specializing  the  operator    K    and    S    we 
obtain  as  special  cases  the  ordinary  Ritz  method,   the  generalized  Ritz  method, 
the  Galerkin  method,   and  the  method  of  moments. 
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In  Chapter  4  we  present  and  investigate  a.  fairly  general  iterative  method 
for  the  solution  of  (i)  which  unifies  and  extends  to  the  eigenvalue  problem  (i)  with 
unbounded  and  nonsymmetric  operators  the  results  of  a  number  of  authors  [1,  3,  6,  11 
16,17,20,29,  33]  obtained  by  them  mostly  for  positive  definite  symmetric  matrices 
and  bounded  operators.     Thus,   in  Sections  4.  1  and  4.  2  we  formulate  the  method 
and  consider  some  of  its  properties  while  in  Section  4.  3  we  derive  the  basic 
theorems  for  the  convergence  of  the  approximate  eigenvalues  and  eigenvectors. 
In  Section  4.  1  we  discuss  conditions  under  which  we  obtain  the  convergence  to  the 
smallest  eigenvalue  and  to  an  element  in  the  corresponding  eigenspace.     Here  we 
also  obtain  an  error  estimate  for  the  approximate  eigenvectors.     Section  4,  5  deals 
with  the  problem  of  constructing  various  methods  (i.  e. ,   depending  on  the  choice 
of  the  sequence     )  t.  >   )    and  their  special  cases.     Here  we  discuss  the  method  of 
constant  factor  [  6  ],   a  relatively  new  method  called  here  the  method  with  relative 
minimal  norms  [20,    29,    30]  and  its  special  cases  [  2  ],   the  accelerated  method 
with  relative  minimal  norms  and  its  special  cases  [19,1], the  generalized  method  of 
steepest  descent  [15,     3  ,   33],   the  method  introduced  by  Hestenes  and  Karush[ll] 
which  we  call  the  modified  method  of  steepest  descent,   and  others. 


-  7 


1.     On  a  Class  of  K-p.  d.   and  Non-K-p.  d.   Operators 

In  this  chapter  we  define  a  class  of  linear,   in  general  unbounded,   operators 
in  a  Hilbert  space  and  derive  some  of  their  properties  to  be  used  in  the  sequel. 
Especially  those  properties  are  derived  which  should  be  useful  in  the  application  of 
the  theory  to  the  differential  and  integro- differential  boundary- value  and  eigenvalue 
problems  of  odd  and  even  order.     The  applicability  of  approximate  methods  is  also 
indicated. 

1.1.     K-positive  definite  operators.      Let    H    be  a  complex  and  separable 

Hilbert  space.     An  operator    T    defined  on  a  dense  domain    D       in    H    will  be  called 

K-positive  definite    (K-p.  d.  )^^    if  there  exists  a  closeable^^^    operator    K    with 

D     3  D       mapping    D       onto  a  dense  subset    KD_,    of    H    and  two  constants   a     >  0 
K  —      i  i  T  1 

and   a     >  0    such  that 

(Tu,Ku)     >    orjlujl^  ,  u  c  D^      ,  (1.1) 

and 

IIkuJI        <    a^{Tu,Ku)       ,  u  c  D^      .  (1.2) 

Let  us  first  note  that  the  class  of  K-p.  d.   operators,   as  defined  above, 
contains  among  others  the  following  operators:      (a)  Positive  definite  operators    T; 
in  this  case  we  choose  the  operator    K    to  be  ei+her  the  identity    I    or,   if    T    is  also 


(1) 

The  class  of  K-p.  d.   operators,   with    K    closed,   was  introduced  by 

Martyniuk  [23]    while  its  properties  were  discussed  by  the  author  [26]    for  the 

class  in  which    K    has  a  bounded  inverse. 

(2) 

Let  us  recall  that    K    is  said  to  be  closeable  if  whenever     ^u    ^      is  a 


.ver     |uj 


sequence  in    D^^    and    f    an  element  in    H    such  that    u   —  0    and    Ku    —  f,     as 
K  n  n 

n  -*  00,     then    f  =  0.      Let  us  add  that,   as  was  observed  by  Rellich    [31],   in  appli- 
cations it  is  more  convenient  to  work  with  closeable  than  with  closed  operators. 


(3) 
selfadjoint,   to  be  any  root  of    T.      (b)  Closeable  and  densely  invertible        operators 

2i+l 
T    when  we  take    K    to  be    T.      (c)  The  operators    T    of  the  form    T  =  S  '^         or 

T  =  S  when  for  some    i,     0  5  i  £  J'     the  operator    S  is  positive  definite; 

2i+l  2i+2 

in  this  case  we  take    K  =  S  or    K  =  S         .      This  class,   m  particular,    contains 

ordinary  differential  operators  of  an  odd  and  even  order  and  weakly  elliptic  partial 

differential  operators  of  an  odd  or  even  order  [  24  ]    which  are,   in  general,   non- 

selfadjoint.      (d)  A  subclass  of  bounded  symmetrizable  operators  investigated  by 

many  authors     [38]. 

Furthermore,   we  shall  see  below  that  there  are  also  K-p,  d,   operators 
which  do  not  belong  to  any  of  the  above  classes. 

Lemma  1.  1.      If    T    is  K-p.  d. ,   then 

(a)  T    IS  invertible; 

(b)  T    IS  K-symmetric;    i.e.,     (Tu,  Kv)  =  (Ku,  Tv),     u,   v  c  D    ; 

(c)  |(Tu,Kv)|      <   (Tu,Ku)(Tv,Kv),     u,    v  c  D^; 

(d)  T    is  closeable. 

Proof.      To  prove  (a)  note  that  since    (Tu,  Ku)  <    jJTu|  •  ||ku|J     the  inequality 

(1.2)  implies  that    J1ku||    <  or^  J  Tu|| .      Hence,   in  view  of  (1.  1),     1]  Tu|j    >  ff  Qr"^||u|| . 

Thus,     T    is  invertible.     The  proof  of  (b)  and  (c)  can  be  carried  word  by  word  from 

[26].      In  order  to  show  (d)  note  that  if     Ju    ^      is  a  sequence  in    D       and    f    an 

element  in    H    such  that    u   —  0    and    Tu   -«-  f    as    n  —  oo,     then  since 

n  n 


Kujj    <  or    ||Tu|j     for  all    u    in    D^    and    K    is  closeable,    we  conclude  that    Ku    =*  0, 
'i  i  n 

as    n  —  00.      Let    v    be  any  element  in    D^.      Since    Ku   -►  0    and    Tu    —  f, 

T  n  n 

Lemma  1.  1(b)  implies  that 

(f,  Kv)     =    lim  (Tu  ,Kv)    =    lim  (Ku  ,  Tv)     =    0      ,     v  c  D 
n  n  T 

n  n 


(3) 

We  will  call  an  operator    T    invertible  if    T    has  a  bounded  inverse, 

densely  invertible  if    T    is  invertible  and  the  range    R       is  dense  m    H,     and 
continuously  invertible  if    T    is  densely  invertible  and    R     =  H. 


The  last  equality  and  the  denseness  of  the  set    KD       in    H    show  that    f  =  0;    i.  e. , 
T    is  closeable. 

Note  that  the  denseness  of    KD       was  only  needed  m  the  proof  of  (d). 

Let    f   be  an  element  in    H    and    F(u)    the  functional 

F(u)    =    (Tu,Ku)     -    (Ku,f)    -    (f,Ku)  (1.3) 

defined  on    D    .       It  was  shown  in    [  26  ]   that  the  problem  of  solving  the  equation 

Tu    =    f  (1.3a) 

is  equivalent  to  the  problem  of  minimizing  the  functional    F(u)    and  that  to  solve  the 
latter  it  is,    in  general,   necessary  to  extend  somewhat  the  set    D       on  which    F(u) 
is  defined  and  with  it  also  the  operator    T.      It  will  be  shown  below  that  using  essen- 
tially the  same  arguments  as  in    [7,   25,    26]   the  variational  problem  is  solvable  and 
that    T    possesses  a  closed  and  continuously  in vertible  K-p.  d.    extension. 

Let    D[T]    denote  the  set    D       with  the  new  metric 

[u,v]    ^    (Tu,Kv)      ,  |u|       =    [u,u]     ,  u,    V  c  D^      .  (1.4) 

Clearly    D[T]    satisfies  all  the  hypothesis  of  the  Hilbert  space  except  possibly  that 
it  is  incomplete.     Furthermore,   in  view  of  (1. 1)  and  (1.  2),   we  have  the  inequalities 

M     >    tJ|u||     ,  T^    =     V^     >   0      ,  u  €  D^  (1.5) 


and 


I|Ku||     <     tJuI       ,  7^    =      /a^     >   0      ,  u  c  D^      .  (1.6) 


Let    H      denote  the  completion  of    D[T]    in  the  metric  (1.4). 

Lemma  1.2.       (a)    D[T]    is  dense  in    H   . 
o 

(b)  H      IS  a  subset  of    H    in  the  sense  of  uniquely  identifying  the  elements 

from    H      with  certain  elements  from    H. 
o 

(c)  K    can  be  extended  to  a  bounded  operator  of  all  of    H      to    H. 

o 

(d)  The  inequalities  (1.5)  and  (1.  6)  are  valid  for  all    u    in    H   . 
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We  omit  the  proof  of  Lemma  1.  2  for  it  follows  the  same  arguments  as  in 
the  proof  of  the  corresponding  lemma  in  [  26  ].       We  need  only  to  replace,   m  the 
proof  of  (b),  the  operator    K  by  its  trivial  closed  extension^   '    K    which  exists  since 
K    IS  closeable.     Furthermore,   the  denseness  of  the  set    KD       is  not  used  in  the 
proof  of  Lemma  1.  2. 

Having  constructed  the  auxiliary  space    H      it  is  now  easy  to  solve  the 

variational  problem.     In  fact,   by  Lemma  1,  2(c),  (f,  Ku)    is  a  bounded  conjugate 

linear  functional  of    u    m    H      and  hence  the  Frechet-Riesz  theorem  implies  that 

o 

to  every  fixed  element    f    in    H    there  exists  a  unique  element    w  €  H      such  that 

for  all    u    m    H 
o 

(1.7)  (f,Ku)    =    [w,  u]      . 
Consequently,   the  functional 

(1.8)  F(u)    =    [u,u]    -    [u,w]    -    [w,  u]    -    |u-wp    -     |w|^     , 

which  by  definition  (1.3)  is  valid  for  all    u    in    D    ,     can  be  also  extended  to  the 

entire  space    H   .      Considered  in    H  ,     F(u)    attains  its  minimum    d    at    u  =  w 
o  o 

with 

(1.9)  d    =      inf     F(u)    =    F(w)    ^    -  |w|^      . 

u  c  H 
o 

We  shall  formulate  the  above  result  in  the  following 

Lemma  1.  3.      If    T    is  K-p.  d. ,   then    d  =^    inf    F(u)    is  attained  at    u  s  w, 

ucHq 
where    w    is  uniquely  determined  by  (1.  7),     Furthermore,  the  value  of    d    is 

given  by  (1.  9). 


(4)  — 

The  operator    K    is  called  a  closure  or  a  trivial  closed  extension  of 

K    in    D       if  it  is  defined  on  the  set    D-    (33  D    )    consisting  of  all  elements    u 

(       ) 
in    H    for  which  there  exists  a  sequence    <  u    >     in    D,^    and  an  element    f    in    H 

(    n)  K 

such  that    Iju    -  uj|  —  0    and    |j  Ku    -•  f]j  —  0    as    n  —  oo;     in  this  case  we  set 

lim  Ku     =  Ku  =  f. 
n 
n 
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Let  us  note  that,  in  general,  the  element  w  which  minimizes  the  func- 
tional F(u)  is  not  necessarily  in  D  so  that  the  equation  (1.  3a)  may  not  have  a 
solution  unless  T  could  be  somewhat  extended.  Theorem  1,  1  below  shows  that 
for  K-p.  d.     T    such  an  extension  is  always  possible. 

Theorem  1.1.       If    T    is   K-p.  d. ,     then    T    can  be  extended  to  a  closed 


K-p.  d.   operator    T      such  that    T     )   T,     T      is  continuously  invertible,   and 

D         consists  of  all  elements  realizing  the  infimum  of    F(u)    in    H      as    f    ranges 
through  the  entire  space    H. 

Proof.       We  know  that    (f,  Kv)    is  a  bounded  conjugate  linear  functional  of 

V    in    H      for  each    f    in    H.      Hence  there  exists  a  linear  bounded  operator    G 
o 

defined  on  all  of    H    into    H      such  that 

o 

(f,Kv)    =    [Gf,  v]     ,  V  c  H  ,     f  €  H     ,  (1.10) 

o 

and,   in  view  of  (1.  6),    |gJ    <  7   .       Moreover,   if    Gf  =  0,     then  by  (1. 10)  we  have 

(f,  Kv)  -  0    for  all    v    m    H   .      Since    KD       is  dense  in    H,     this  implies  that 

f  =  0.      Thus    T     =  G        exists  and  is  a  closed  operator.     Furthermore,     T      is 
o  o 

K-p.  d.     and    T     IZI)  T.      In  fact,    setting    w  =  Gf    we  see  that    w  c  D„     CZ    H  , 
o  ^  Tq    —       o 

T  w  =  f ,     and  the  relations  (1.  10),   (1.5),   and  (1.6)  imply  that 

(T   w,Kw)     ■■--    [GT   w,  w]    -    [w,  w]     >    Qf^H^I' 


and  thus,   also,   that 


Kwll     <    a_[w,  w]    =   a SY  ^ ,Yi^) 


This  shows  that    T      is   K-p.  d.      To  prove  that    T     _D  T    observe  that  the  defini- 

o  -  ^  o  

tion  of  the  H   -metric  on  the  one  hand  and  the  relation  (1. 10)  on  the  other  yield  the 
equality 

[u,  v]    -    (Tu,Kv)     =    [GTu,  v] 

for  all  pairs    u^    v    in    D[T].      Since    D[T]    is  dense  in    H  ,     the  last  equality  implies 

that    u  -  GTu    for  all    u    in    D    .       This  shows  that    u  c  D  and    T   u    =    Tu;      i.  e. , 

i  1q  o 
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T     HD  T    and    R^     ^  H.       The  last  assertion  of  Theorem  1,  1  follows  from  the 

°  ^o  _1 

definition  of  the  operator    T      (  =  G     )    and  Lemma  1,3. 

o 

The  operator    T      will  be  called  a  solvable  generalized  Friedrichs 

extension  (s.  g.  F,  e. )  of    T.      For    K  =  I,     Theorem  1.  1  furnishes  a  selfadjoint 

extension  of  a  symmetric  positive  definite  operator  constructed  by  Friedrichs  [7, 

25].      In  case    K    is  continuously  invertible  Theorem  1.  4  reduces  to  a  similar 

theorem  in  [  26  ], 

Remark  1.       T    may  still  have  other   K-p.  d.    extensions.     But  among 

these  extensions  there  is  only  one,   the  operator    T      we  have  just  constructed, 

whose  domain  is  contained  in    H      and  which  in  some  sense  is  maximal  [26]; 

i.e.,   if    t'    is  an  arbitrary  K-p.  d.   extension  of    T    such  that    D    jd  H  ,     then 

T      ZD    t\ 
o  

Remark  2.       We  complete  this  section  with  the  proof  of  a  theorem  that 
can  be  used  in  one  of  the  often  applied  techniques  in  the  investigation  of  compli- 
cated operator  equations  which  consists  in  comparing  these  equations  with  much 
simpler  operator  equations,   the  properties  of  which  are  well  known.     The  "close- 
ness",   defined  in  some  sense,   of  the  two  operator  equations  implies  the  com- 
munity of  various  important  properties  such  as  existence  and  uniqueness  of  solu- 
tions,  the  applicability  of  various  approximate  methods  and  their  convergence, 
etc.     Theorem  1.  2  below  offers  the  possibility  of  introducing  a  rather  simple 
principle  of  comparison  and  of  deriving  from  it  a  series  of  the  above-mentioned 
properties. 

Theorem  1.  2.       If    T    is   K-p.  d.   and    K    is  closed  with    D      ~  D    ,     then 
there  exists  a  constant    9     >  0    such  that 

(1.11)  ||tu||    <   e^j|Ku||    ,  u  c  D^    . 

Furthermore,     T    is  closed,     R      =  R      =  H,     and 

i  K. 

/g 

(1.12)        l|u||    <    aM    ||Ku||     <     tJuI     <    T2IITUII     <     9^y^|Kul|     ,       u  c  D^  . 
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Proof,      Consider  the  space    H  .    By  definition  of    H  ,  H     — ->  D„,      How 
o                                          o         o  T 

ever,    since    K    is  closed  it  is  not  hard  to  see  that    D,,  __)   H   .  In  fact,   let    u    be 

K  T o 


any  element  in    H  ,      then  there  exists  a  sequence     S  u    >      in    D^  ~  D,,    such  that 
•^  o  ^  (    n)  T         K 

ju    —  u|  -«-  0    as    n  "*  00,     whence  by  (1.  5)  and  (1.  6)  we  see  that     jju    —  uj|  -•  0    and 


Ku    -  Ku         —  0    as    n    and    m    increase  indefinitely.     Since    K    is  closed,   this 

n  m"  ■^ 


implies  that    u  €  D,,    and    Ku    -►  Ku    as    n  -►  oo;      hence    H     C_    D,,.       This  shows 

K  n  o  K 

that    H     =D    ;      i.e.,     H     =  D[T].       Exactly  as  in    [26]   we  show  that    T,     as  an 
o         K  o 

operator  from  all  of    H      to    H,     is  closed  and  therefore,   by  the  closed- graph 
theorem    [32],   is  bounded.     Hence  there  exists  a  constant    9  >  0    such  that 

9 

||Tu||    <   9|u|     for  all    u    in    H   .      This  implies  (1.  11)  with    9=9. 

To  prove  the  second  part  of  Theorem  1.  2  we  first  note  that  since,   by 

hypothesis,     K    is  closed  and    R       is  dense  in    H    and,   by  (1.  11)  and  Lemma  1.  1(a), 

K 

K    has  a  bounded  inverse  on    R       we  must  conclude  that    R     =  H.      That    T    is 

K  K 

closed  as  an  operator  in    H    follows  trivially  from  (1.  2),   (1.11),   and  the  properties 

of    K.      Therefore,     R       is  closed.     Thus  to  show  that    R      =  H    it  is  sufficient  to 

show  that    Z^*  "    (°i  •      Let    g  c   Z^*  .      Since    B^^  CI  R^    (  =  H)    and    D^^  =  D^ 

there  exists    u  c  D       such  that    Ku  =  g.       For  such    u    we  have 

K 

0    -    |(u,T*g)|     =    |(Tu,Ku)|     >    ff^l^ll^ 

showing  that    u  =  0    and    g  =  Ku  =  0;      i.  e. ,     Z    ^  =    \oi  .      The  inequalities  (1. 12) 
follow  immediately  from  (1,  1),   (1.2),   and  (1.11). 

Corollary  1.1.       If    T    is   K-p.  d.   and    K    is  closed  with    D      =  D     ,     then 

K  T 

the  operators    T    and    K    form  an  acute  angle.  ^   ' 


(5) 

Following  Sobolevsky  [34]  we  say  that  two  densely  defined  operators 

P    and    R    form  an  acute  angle  if    D      =D    ,     (Pu,  Ru)  >   5]|Puj|  ||Ru|]     for  all    u 

in    D       and  some    6  >  0,     and  they  vanish  only  at  zero  element.     Later  on  we  will 

also  use  this  definition  in  a  slightly  more  extended  sense  m  which  the  above  in- 
equality is  replaced  by  the  more  general  one:      j(Pu,  Ru)j    >   6J]Pu|  jJRulJ, 
u  c  D    ,     5  >  0. 
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Proof,       The  proof  follows  from  (1.2)  and  Theorem  1.2.     In  fact, 
(1.12°)  (Tu,Ku)     >     6||Tu||    ||Ku||      ,  5     =    J-      ^  u    c    D 

The  consequences  derived  from  Theorem  1.2  and  Corollary  1.  1  will  be 
discussed  in  Section  1.  4. 

1.  2.     Extensions  of  non-K-p.  d.   operators.      In  this  section  we  prove  a 
theorem  concerning  s.  g.   Friedrichs  extensions  for  a  much  more  general  class 
of  nonsymmetric  operators  and  then  by  specializing  the  operators  we  derive 
from  it  some  of  the  results  of  Lax  and  Milgram  [21]  under  slightly  less  restric- 
tive conditions.     At  the  same  time  we  derive  slight  generalization  of  the  corre- 
sponding results  m  [26]. 

Theorem  1.3.      Let    T    be  K-p.  d.   and    L    be  an  operator  with    D     =  D    . 
L  1 

If 

(1.13)  l(Lu,Ku)|     >     Hj^lu]^      ,  n^     >     0      ,  u    €   D^ 
and 

(1.14)  l(Lu,Kv)l     <     nJn\  |vj       ,  n       >     0      ,  u    c    D, 


then 


'2  '  "     -     --rp       ' 


(a)    L    has  a  s.  g.  F.  e.     L      such  that    L      is  closed.     L     — '  L,     and 
°  o  o  o  


L      IS  continuously  invertible; 

(b)    L     =  T   W  ,     where    W      is  the  extension  of  the  operator    T     L    to 
000  o  o 


H  . 

o 


Proof.       To  prove  (a)  observe  that,   by  Lemma  1.2,     (Lu,  Kv)    is  a 


bounded  conjugate  linear  functional  of    v    in    H      for  each    u    in    D      ^  D    .       Hence 

there  exists  a  linear  operator    W    in    H      with  domain    D       =  D[Tl    such  that  for 

o  W 

all    V    in    H      and    u    in    D,„ 
o  W 

(1.15)  (Lu,Kv)     =    [Wu,  v]      . 


15  - 


By  (1. 14)  the  operator    W    is  bounded  while  by  (1.  13)  it  has  a  bounded  inverse    W 

on    R„,  CZ.    H  .      This  and  the  denseness  of    D„,    in    H      shows  that    W    possesses 
W  o  Wo  _^  

the  unique  bounded  closure    W    defined  on  all  of    H      such  that    R-—   =   R„,    and    W 

o  WW 

has  a  bounded  inverse     W        on    R— .      Furthermore,   this  and  (1.  13)  imply  that 

W 

R—  =  H   .      In  fact,   if  we  denote  by    Z— *    the  zero  manifold  of     W*,     then  since 
Wo  W  ,      , 

R-r-    ©    Z— -*  =  H      our  assertion  will  follow  if  we  show  that    Z-— *  =    <  0  f   .      Let 
W  W  o  W  (      ) 

g  c   Z— 5|<;      then  by  (1.  15),   (1.  13),   and  the  continuity  of  the  inner  product  and  the 
operator    W    we  have  for  this    g 

0    =    |[g,W*g]|     =     |[Wg,g]|     >     r)Jg|^      . 

This  shows  that    g  ~  0    and    R—  =  H   . 

To  obtain  the  extension    L      we  define  the  generalized  solution  of 

Lu    =    f     ,  f    €    H     ,  (1.16) 

to  be  an  element    u    in    H      such  that  for  all    v    in    H 

o  o 

(f,  Kv)    =    [Wu,  v]      .  (1.17) 

It  is  evident  from  (1.  15)  that  every  ordinary  solution  of  (1.  16)  is  also  its  general" 
ized  solution  but  the  converse  need  not  necessarily  be  true.     Moreover,    for  any    f 
in    H    there  exists  a  unique  solution  of  (1.  16)  which,   in  view  of  (1. 10),    must  satisfy 
the  relation 

[Wu,  v]    -    [T^^f,  v]  (1.18) 

for  all    V    in    H      and,    consequently,   the  equation 

Wu    =    T^'^f      .  (1.19) 

o 

It  is  evident  from  (1.  18)  and  (1,  10)  that    u    determined  by 

u    =    W  "■^T^'^f  (1.20) 

o 

is  a  generalized  solution  of  (1. 16).     Furthermore,   it  is  unique.     For  if    u      is  any 

other  generalized  solution  of  (1. 16),   then    [  Wu,  v]  -  [  Wu  ,  v]    or    [  W(u— u*),  v]  =  0 
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for  all    V    in    H      and,   in  particular,   for    v  =  u  —  u*.      This  together  with  (1.  13), 
o 

(1.  15),    and  the  continuity  of  the  inner  product  implies  that    u  =  u   . 
The  equation  (1. 19)  suggests  that  we  take    L.      as 

(1.21)  L       =    T   W     . 

o  o 

Following  the  same  arguments  as  in  [26]  one  shows  that    L      so  defined  has  all 
the  properties  specified  in  (a)  with 

(1.22)  l"'^    =    w'^^t" 

o  o 

-1 
To  prove  (b)  let  us  first  note  that  the  operator    T     L    defined  over    D 

°  -1  ^ 

is  bounded  in  the  H  -norm.     Indeed,   if  for  any    u    m    D^     we  put    v  =  T     Lu, 

o  Lj  o 

then  by  (1.14) 

|vj^    =     |t'^Lu|^    =    |(Lu,Kv)|     <     njul  |v]      . 
o  ^ 

This  shows  that    T     L    is  bounded  in    H   .      Let    W      be  its  unique  extension  to 

o  o  o 

H  .      It  follows  from  (a)  that  to  show  that    L     =  T   W      it  is  sufficient  to  show  that 
o  o         o     o 

W     =  W    or  equivalently  that    T      L  =  W.       The  latter  equality  follows,   however, 
from  the  definition  of    W    m  (1.  15)  and  the  identity  (1.  18)  in  which    f    is  any  ele- 
ment of  the  form    Lu    with    u    m    D    .      This  completes  the  proof  of  Theorem  1.3. 

L 

Remark  3.      Theorem  1.  3(a)  remains  valid  if  instead  of  the  inequality 

(1. 14)  we  assume  that    L    is  K- real  for  all    u    in    D       and  has  the  property  that 

L 

(Lu,Kv)  =  0,     u   in    D    ,     implies    v  =  0.      In  this  case    W    determined  by  (1. 15) 

-1 

is  symmetric  and,   though  no  longer  bounded  in    H  ,     has  a  bounded  inverse    W 

defined  on  the  range    R,^,    which  is  dense  in    H   .      The  s.  g.  F.  e.   of    L    is  given 

W  o 

by    L     =  T   W,     where    W    is  the  trivial  extension  of    W    in    H   . 
■^        o  o  o 

(6) 
Corollary  1.  2.  If    T    is  K-p.  d.   and    L  =  T  +  S,     where    S    is  an 

operator  over    D       such  that  for  all    u    in    D 


(6) 

Corollaries  1.  2  and  1.  3  were  first  proved  for  real  space    H    by  Lax 

and  Milgram  [21]  for  the  case  when    K  -  I    and  Condition  (a)  is    (Lu,  u)  >  37„ju|    . 
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n3>o    , 

(a) 

n^  >  0    , 

(b) 

(1.23) 

|(Lu,Ku)|     >     Hglu] 
and 

|(Su,Kv)|     <     r,^|u|  |v|      , 
then 

L      =    T   (I  +  N  )      , 
o  o  o 

where    N      is  the  extension  of    T     S    to    H   . 
o  o  o 

Proof.      Corollary  1.  2  is  an  immediate  consequence  of  Theorem  1.  3 

for,   in  view  of  Lemma  1.  1  and  Conditions  (a)  and  (b),   the  operator    L  =  T  +  S 

satisfies  (1.  13)  with    n,   =  I,    ^nd  (1.  14)  with    rj     =  1  +  rj   .      Hence,   by  Theorem  1.  3, 

L    has  a  s.  g.  F.  e.     L     =  T   W  ,     where    W      is  the  extension  of    T     L    to    H   . 

-1  -1       °         °     °     -1  °  °        -1       ° 

Since    T     L  =  T      (T  +  S)  =  I  +  T     S    on    D       and  by  (b)  the  operator    T      S,     being 

defined  over    D    ,     is  bounded  in  the    H  -norm  and,   therefore,    can  be  uniquely 

extended  to  all  of    H   .      Let    N      be  this  extension.     Then    I  +  N      is  also  an  exten- 

o  o  o 

sion  of    T     L    to    H   .      Since  it  is  unique  we  must  conclude  that    W     =  I  +  N   :     i.  e. , 

O  O  GO 

L     =  T   (1  +  W  ). 

O  O  O 

Corollary  1.3.       If    T    is  K-p.  d.   and    L  =  T  +  S    is  such  that 

|(Lu,Ku}|     >     r7^|u|         ,  u  c  D^     ,  (a) 


llSuil     <     ^.Nl       ,  ?,   >  0      ,  u  c  D_      ,  (c) 


4  '  T 

then    D^      =  D_      and    L     =  T     +  S,     where    S    is  the  extension  of    S    to    H  . 
L  T  o         o  o 

o  o 

Proof.      As  before  we  see  that    L  =  T  +  S    satisfies  (1.  13)  and  (1.  14)  with 

rj-  =  Ho    and    rj_  =  1  +n.y„    and    L     =  T   W  .       Furthermore,   (c)  implies  that    S 
lo2  42  ooo 

can  be  extended  by  closure  to    S    defined  on  all  of    H      and  that    T      S,     being 
■^  o  o  ^ 

bounded  over    D^    in  the  H   -norm,    can  be  extended  to  all  of   H   .      Since  the  ex- 
_1    T  o  _^_ 

tension  of    T     S    is  unique  it  must  be  equal  to    T      S    and,    consequently, 

W     =I  +  t"'^S.      This  shows  that    L     =TW     =  T  (I  +  t""^  S  H^   T     +  S;     i.  e. ,   if 
o  o  o         o     o         o  o        o 

u  C  D^  ,     then    u  c  D^        and    L  u  =  (T     +  S)u. 
T  Loo 

o  o 


-  18 


To  complete  the  proof  we  need  still  to  show  that    D       d.    D         and 

^o     _i      ^o^i    _1 
L  u  ^^  T  u  +  Su    for  all    u    in    D,     .      Now  if    f  e  H,     then    u  =  L     f  =  W     T     f 

o  o  -1  ^o  „1-  o       _     o      o 

is  such  that    (f,  Kv)  =  [T     f,  v]  ^  [W  u,  v]  =  [(I  +  T      S)u,  v]  =  [u,  v]  +  (  Su,  Kv), 

°    -  °     -1       -  °  -1        - 

v€H,     i.e.,     [u,  v]  =  (f-Su,Kv)  =  [T     (f-Su),  v]    or    u  =  T     (f-Su).      This 
o  o  o 

shows  that    D      CI    D_      and  that    L   u  =  T  u  +  Su,     u  e   D,     .      Together  they 
Lo  T^  o  o  Lq 

imply  the  assertion  of  Corollary  1.  3. 

We  complete  this  section  by  proving  a  theorem  which  is  a  generalization 
of  Theorem  1.2  to  non-K-p.  d.   operators  and  for  which  the  comments  expressed 
m  Remark  2  are  applicable. 

Theorem  1.4.  Let  T  by  K-p.  d.  and  K  closed  with  D  =D  .  If 
L  is  a  closeable  operator  for  which  (1.13)  is  satisfied  for  all  u  in  D  -  D  , 
then    L    has  a  boiinded  inverse    L        on    R       and 

^itJ|u||    <  £ju|    <    ||lu||    <   g^M    , 

(1.24) 

./    "i 

u    c    D 

1j 


Furthermore,     L    is  closed  and    R      =  R      =  H. 

i-i  K. 

Proof.      Since    ||Lu||||ku||    >   |  (Lu,  Ku)  |   >  n    \u\^ ,     (1, 12)  implies  that 

jJL.uj|    >J^,|uj   >J^,T-j|ujj,     i.e.,     L         exists  and  is  bounded  on    R    .      By 

111  Lj 

Theorem  1.2,     H     =  D[T]    and,   since    L    is  closeable  in    H,     it  is  not  hard  to  see 

that    L,     as  an  operator  from  all  of    H      to    H,     is  closed.     Hence  there  exists 

o 

9     >  0    such  that    J|Luj|    <  0    juj.      This  proves  the  first  assertion  of  Theorem 
1.  4.     Using  the  above  inequality  (1.  24)  and  repeating  the  arguments  of  the  proof 
of  Theorem  1,  2  we  obtain  the  validity  of  the  second  assertion  of  Theorem  1.  4. 

Corollary  1.  4.      If    L    satisfies  the  conditions  of  Theorem  1.4,   then    L 
and    K    form  an  acute  angle  with 

7, 


(1.25)  |(Lu,Ku)|     >     6J|Lu||  1|Ku||     ,         6^    =    — ^    <     1,        u  c  Dj_^    . 
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Let  us  remark  that  when    L  =  T    then    L    is   K-p.  d.   with    D     =  D       and 
ri     =  1    so  that  in  this  case  Theorem  1,  4  and  Corollary  1.  4  reduce  to  Theorem  1.  2 
and  Corollary  1.  1,   respectively. 

1.  3.     Projectional  methods.      In  this  section  we  indicate  briefly  the  use- 
fulness of  Theorem  1,  2  and  Theorem  1.  4  by  combining  them  with  the  recent  results 
obtained  by  Polsky  [28]  and  proving  thus  some  of  the  assertions  expressed  in  Re- 
mark 2. 

A  sequence  of  closed  subspaces    M      of   H    will  be  called  projectionally 
complete  [28]  if    Jj  f  -  P   f||  —  0,     as    n  —  oo,     for  every    f    in    H,     where    P      is  an 
orthogonal  projection  of    H    onto    M  .      Consider  the  operator  equation 

Lu    =    f     ,  f  e   H     ,  (1.26) 

where  the  operator    L    satisfies  the  conditions  of  Theorem  1.4,     Let     \  R    i     be  a 

{     n  ) 

sequence  of  finite  dimensional  subspaces  of    D      =  D     d    H    such  that  the  sequences 

(  )  (  )  (  )  (  .         L  K 

^L     >     =<LR     >      and     jM    ^     =<KR     i      are  projectionally  complete  m    H;      let 

P      and  denote  the  orthogonal  proiectories  of   H    on    M      and    L  ,     respec- 

n  "n  &  r-      J  ^  n 

tively.     Then  according  to  the  projectional  method  [28]       the  approximate  solution 

u     e   R      is  determined  from  the  condition 
n  n 

P    Lu      =    P    f     .  (1.27) 

n        n  n 

Theorem  1.  5.  If  L  satisfies  the  conditions  of  Theorem  1.  4  and  the  se- 
quence /  R  C  d  D  is  such  that  |  L  >  anu  \  M  i  are  projectionally  com- 
plete in    H,     then 

(a)  For  every    f    in    H,     the  equation  (1.26)  has  a  unique  solution    u  c   D    . 

L 

(b)  The  approximate  equation  (1.27)  is  uniquely  solvable  for  every    f    m 


H    if  and  only  if    "cT    >  0,     where     c"     =         min  P    v    . 

n  n  T        II     II     1       " 

vcL^,  l|vj|  =  l 


(7) 

For  a  different  formulation  of  a  family  of  direct  methods  which  also 

utilizes  the  concept  of  a  projection  see  author's  paper  [26].     In  some  cases  the  two 

formulations  are  essentially  equivalent. 
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(c)    The  error  vectors    w     =  u  -  u      and  the  residual  vectors    r     =  Lu  -  Lu 

n  n  n  n 

converge  to  zero  for  any    f  e   H    if  and  only  if  the  condition    (A):     ^  >  0,     where 
t  =  lim  ^  J     is  satisfied.     If  this  is  the  case,   then  the  error  estimates  are  given  by 


n 


n 

(1. 

28) 

and 

(1. 

29) 

Lu     -Lu^ll     <     (1+   ^)j|f-    Tl^fll 


n 


||u     -uj     <     lL-l||(l+f)||f-    TT_,f||      , 

n 


where    u      is  the  exact  solution  of  (1.  26)  and    u      an  approximate  solution  of  (1.  27). 

n 

Proof.       The  proof  of  Theorem  1.5  is  essentially  based  on  Theorem  1.4  and 

Theorem  1  in  [28].     In  fact,   (a)  follows  directly  from  Theorem  1.4.     To  prove  (b) 

and  (c)  it  is  sufficient,   in  view  of  results  obtained  m  [28],   to  show  that  our  condi- 


tions on    L    imply  that     ^     >  0    and    Z  =  lim    T     >  0. 

n  n 

n 

Let    v   be  an  arbitrary  element  in    L      and    w    in    M      such  that     ||  v 

n  n 

and    Ijwjl   =  1;       put    v  =  Lu,     u  e   R   .       Then,   by  Corollary  1.4, 


=  1 


P   V       *    max     (v,  w)       =    max 
n    '  11 


w 


w 


(i; JT  >  w) 

Lu 


>       ( 


Lu 


Ku 


Dal  '  IlKull 


^     \ 


Consequently,      C     =  min    J  P   v      >    6,     and     Z  =  lim  ^     >   6, .       Q.  E.  D. 

n  "     n    "    —      1  n  —      1 

V  n 

Corollary  1. 5.      If    T    is  K-p.  d.   and    K    closed  with    D      =  D    ,     then  for 

the  equation    Tu  =  f,     f  c  H,     the  assertions  of  Theorem  1.5  remain  valid. 


1.4.     H  -bounded  operators.      In  later  discussion  we  shall  also  have  the 
o 


opportumty  to  deal  with  closeable  operators    S    such  that    D, 


H 


We  call  such 


operators    H  -bounded, 


(8) 


If    S    is    H  -bounded,   then  by    ^    we  denote  the  operator 


If   K    is  closed  and    D      =  D    ,     then  in  view  of  Theorem  1.  2,   the  concepts 

IS.  1 

of   H  -boundedness  and    H  -compactness  of    S    reduce  to  the  concepts  of    T-bounded- 
o  o 

ness  and    T- compactness  introduced  by  Gokhberg  and  Krein  [  9], 
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from    H      to    H    defined  by    Su  -  Su.      The  introduced  definition  is  explained  by 

Lemma  1. 4.       If    S    is  H   -bounded,   then    S     is  bounded  and  consequently 

there  exists  a  constant    r]^   >  0    such  that 

5 

l|Su||     <    T7g|u|      ,  u  c  H^      .  (1.30) 

Proof.      To  prove  Lemma  1.  4  we  observe  that,   m  view  of  the  closed-graph 

theorem  and  the  fact  that     S     is  defined  over  all  of  the  complete  space    H   ,     it  is 

°  (       ) 

sufficient  to  show  that     S     admits  in    H      a  closed  extension,   i.  e.  ,   whenever      >  u    < 

-  o  (    n) 

is  a  sequence  in    H      and    f    an  element  in    H    such  that      u    -  0     -►  0    and 

o  '    n         ' 

Ij  Su    -  f jl  —  0    as    n  ■*  00,     then    f  =  0.      By  (1.  1)  and  the  definition  of     S     we  see 

that    u    -*  0    and    Su     —  f,     as    n  —  «.      Since    S    is  closeable,     f  =  0.       Thus,      S 

n  n  — 

is  closed  in    H   .      Being  everywhere  defined  in    H   ,     it  is  bounded,   i.  e. ,   there 

exists  a  constant    rj^   >  0    such  that  (1.30)  holds. 

5 

Let  us  note  in  passing  that  when    S    satisfies  the  inequality  (1.  30)  for  all 
u    in    D       then    S    considered  as  a  transformation  from    D[T]    to    H    can  be  ex- 
tended to     S     mapping  all  of    H      into    H    but,   in  general,      S     need  not  be  H  - 
bounded  for  it  may  fail  to  be  closeable  as  an  operator  in    H. 

-1 

Lemma  1.5.       If    S    is   H  -boimded,   then    ST         is  bounded  in    H. 

o  o 

-1 
Proof.      Since    ST        is  defined  everywhere  in    H    it  is  evident  that  to 

°  -1 

prove  its  boundedness  in    H    it  is  sufficient  to  show  that    ST        is  closeable  in    H. 

-1  ° 

Suppose    u     —  0    and    ST      u    ■*  f,     as    n  ■«■  oo,     for  some    f    in    H.      Then 

-1     "  °      ""-1 

V     =T     u    —  0    and    Sv     =ST      u     —  fas    n-^oo.      But    S    is  closeable:    hence 
n  o      n  n  o      n 

f  =  0.       Thus,   since    ST        is  closeable  and  defined  on  all  of    H,     it  is  closed  and 

o 

therefore  bounded. 

An  operator    S    will  be  called  H  -compact  if    S    is   H  -bounded  and  the 

o o 

corresponding  operator     S     is  compact;    i.  e. ,   if  for  every  set 
Q=<u;ucH,     JuJ<m/     the  image     SQ      is  compact  in    H.      As  will  be 
seen  later  such  operators  are  of  great  importance  in  proving  the  existence  of 
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eigenvectors  and  the  applicability  of  various  approximate  methods.     The  two 

theorems  to  be  proved  below  offer  us  various  possibilities  and  conditions  under 

which  the  operator    T     S    is  compact  in    H   .      This,   as  we  shall  see  later,   will 

o  o 

be  proved  useful  in  the  application  to  differential  equations  and  eigenvalue  prob- 
lems. 

■=■1 
Theorem  1.6,      The  assertion    (a):   S  is  H   -compact    =>    (b):   T     S    is 
_^  o  o 

compact  in    H      and    (c):    ST        is  compact  in    H,       Furthermore,     (b)    =>    (c). 
o  o 

If  in  addition  we  assume  that    K    is  closed  and    D     =  D    ,     then 

K.  T 

(a)     <=>     (b)     «=>     (c). 

Proof.      (a)    =>     (b).      Let    Q      be  an  arbitrary  bounded  set  in    H   . 
Since  by  Theorem  1.  1  the  operator    T      ,     considered  as  a  transformation  from 

H    to    H  ,     is  bounded  it  maps  the  compact  set     SQ      in    H    onto  the  compact  set 

-1         °       -1  "l° 

T     SQ     =T     SQ      inH.      This  implies  that    T     S    is  compact  in    H   . 
o  —  ^o         o         o  o  o  o 

(a)    =>     (c).      Let    Q    be  a  bounded  set  in    H;     then  its  image  set 

Q     =  T     Q    is  bounded  in    H   .      This  and  the  relation 
o         o  o 

(1.31)  ST'^^Q    =    SQ 

o  —     o 

show  that  the  compactness  of    SQ      in    H    implies  the  same  for  the  set    ST     Q, 

-1  ~    °  ° 

i.e.,     ST        is  compact  in    H. 


(b)    =>     (c).      Let 


<  f    ?     be  any  sequence  in    H    converging  weakly 


to  some  element    f    of    H;     then  its  image  sequence    u     =  T     f      converges 

n  o     n 

weakly  inH      tou  =  T      fcH      because  for  any    v    in    H 
"^  o  o  o  ■^  o 

[u^,v]    -    [T^^f^,v]    -    (f^,Kv)     — ^     (f,Kv)     -    [T^^f.v]    -    [u,  v]    . 

Let    f    and    g    be  weak  limits  in    H    of     )  f    ?      and     <  g    >  ,     respectively.     Then 

u  -  T     f    and    v  -  T     g    are  the  weak  limits  in    H      of     i  u    >■  =  <  t"   f    ^     and 
o  o   *=  o  (     n  )        (      o     n  ) 


|v^=-^T     g(*     respectively,   and 
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[t"-^Su   ,v   ]    =    (Su  ,Kv  )     =    (ST'-^f  ,KT"-^g  )      .  (1.32) 

onn  nn  ono   ^n 

-1 
Since,   by  Lemma  1.5,     KT         is  bounded  in    H    the  relation  (1.32)  shows  that  the 

-1  °  -1 

compactness  of    T     S    in    H      implies  the  compactness  of    ST         in    H. 

o  o  o 

To  prove  the  third  part  of  the  theorem  we  need  only  to  show  that 
(c)    =>     (a).      In  view  of  our  hypothesis,   this  is  an  immediate  consequence  of 
Theorem  1.  2  and  the  relation  (1.  31)  for  in  that  case    H     -  D[T]    and  the  inequali- 
ties (1. 11)  and  (1. 12)  show  that  not  only  is  the  image    Q     =  T     Q    of  a  bounded 
set    Q    in    H    is  bounded  in    H      but  also  the  image    TQ      of  a  bounded  set    Q      in 
H      is  bounded  m    H.      This  and  (1.31)  implies  that  if  any  one  of  the  sets    ST      Q, 
SQ      is  compact,   then  so  is  the  other. 

Theorem  1.  7.       If    S    is   H   -bounded  and    KT         is  compact  in    H,     then 
^  o  o 

T     S    is  compact  m    H   . 
o  ^  o 

Proof.      Let    Q      be  any  bounded  set  in    H   ,     i.e.,     Jul   <   M,     u  e  Q   . 
o  o  '   —  o 

Since    S    is   H   -bounded.    Lemma  1.4  shows  that  there  exists  a  constant    n     >   0 
o  '5 

such  that 

Su|]     <     n    \u]     <    rj   M     ,  u  e   Q        .  (1.33) 

O  D  O 

This  and  the  compactness  of    KT         in    H    enable  us  to  extract  from    Q      a  sequence 

o  o 


of  elements      >  i-i    (     such  that 


lim    JIkt"   S(u    -  u     )j|     =    0      .  (1.34) 

"^      o  n         m  '' 

n,  m 


Thus,   by  (1.4)  and  (1,33), 

|t"'^S(u   -u     )p    =    (S(u  -u     ),Kt'^S(u  -u     )) 
'o         nm*  nm  onm 

<     ]]s(u   -u     )1|  ]|kt"^S(u   -u     J]     <     2r7,Ml|KT~-^S(u   -u     )1] 
—     "        nm""        o         nm"—         5"        o         nm" 

from  which,   on  account  of  (1.  34),   we  conclude  that    It     S(u   -u     )I      — >    0    as    n 

'     o         n       m  ' 

and    m    increase  indefinitely.     Hence  the  operator    T      S    is  compact  in    H   . 

o  o 
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1.  5.  Error  estimate  of  the  generalized  Ritz  method.  We  complete  this 
chapter  by  using  the  above  results  in  the  brief  discussion  of  the  error  estimate  of 
the  generalized  Ritz  method. 

It  was  shown  in  Lemma  1.  3  and  Theorem  1. 1  that    F(u)    attains  its  mini- 
mum value    d    at    w  =  T      f    and  that  for  any  other    u    in    D_      the  deviation  of 
F(u)    from    d    measures  the  distance  from    u    to    w    by  means  of  formula  (1.8): 
ju  -  wj      =  F(u)  -  d.      This  shows  that  a  sequence  of  elements     )  u    v      converges 
to    w    if  and  only  if    F(u  )    converges  to    d.      This  is  the  basis  of  the  Ritz  method 
which  does  effectively  produce  a  sequence     >  u    ?      in    D       such  that 

lu     -  wj   =  F(w  )  —  d   -►    0,     as    n  -►  00.      However,   formula  (1.8}  does  not  offer 
'    n  '  n 

us  the  means  to  estimate  the  error     lu     -  w]     unless  we  know  the  lower  bound 

'    n  ' 

for    d. 


To  obtain  the  error  estimate  it  seems  more  convenient  to  regard  the 
generalized  Ritz  method  as  a  special  kind  of  a  projection  method  [26]  according 
to  which  we  take  the  approximate  solution    u      in  the  form 

n 

n 


(1.35)  u      =     /        a^.      , 

n  ^^—7^      11 

1=1 

where    ^  ,     i  =  1,  2,  .  . , ,     is  a  system  of  linearly  independent  elements  in    D    , 

complete  in    H  ,  and  determine  the  coefficients    a^,  ....  a      from  the  condition 
o  In 

that 

(1.36)  u      -    P  w    =    P  T'-'^f     , 

n  n  no 

where    P      is  the  orthogonal  projection  of    H      onto  the  subspace    H      determined 
n  On 

by    i,> .  .  .  .^   '      The  condition  (1.  36)  leads  to  the  algebraic  system 

n 


(1.37)  ^^    (T*.,Ki)a"    =    (f,K*)      ,  j  -  1,  2,  .  .  . ,  n      , 

1=1  ■*  ^ 

which,    in  view  of  linear  independence  of   ^.    in  the  H   -metric,    is  uniquely  solva- 
ble for  each    f.      It  is  now  easy  to  prove 
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Theorem  1.  8.      Let    S    be  a  continuously  invertible  operator  with 

D_  C        D„  C        H   .       If    T     S    can  be  extended  to  a  compact  operator  in    H  , 
1   o  o  o  o 

then  for  any    e  >  0    there  exists  an  integer    n      such  that  for  all    n  >  n      and 

o  o 

any    f    in    H    the  error  estimate  for  the  generalized  Ritz  method  is  given  by 

|u^  -  w|     <    e|s"^f|      ,  (1.38) 

where    w    is  the  exact  solution  of  (1.  3a)  and    u      is  its  approximation  determined 
by  (1.35)"(1.37). 

Proof.      Let    N    denote  the  extension  of    t"   S    to  all  of    H   .      Since    N    is 
o  o 


compact  in    H      it  is  known  [32]  that  for  any    c  >  0    there  exists  an    n      such  that 
o  o 

for  all    n  >  n 
o 


N  -  P  N       <     e     .  (1.39) 

n    ' 

Hence  from  (1.  39),   (1.  36),   and  the  property  of    N    we  obtain  (1.  38),     In  fact, 

|w  -  u    I      =      Ix'^^f  -  P  T"-^f|      =      Ut'^^S  -  P   T"'^S)s"'^fj      =      ](N  -  P  N)s"'^fj 
'  n'  'o  no'  'o  no  '  '  n' 

<      |n  -  P^NJ   js'^fl      <     €|s"^f|      . 

which  is  precisely  the  error  estimate  (1.38). 

Corollary  1.  6.      The  Theorem  1.8  remains  valid  if    S    is  continuously  in- 
vertible with    D^  =  H      and  either    S    is  H  -compact  or    KT"      is  compact  in    H. 
So  o  o 

Proof.      The  proof  of  Corollary  1.  6  follows  from  Theorem  1.  6,   Theorem 
1.7,   and  Theorem  1.  8. 

Corollary  1.  7.      If    T      is  a  selfadjoint  and  positive  definite  operator  with 

-1  1/2 

T         compact  in    H,     then  Theorem  1.8  remains  valid  for    K  =  I    and    S  =  T  with 

o  o 
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(1.38^)  |u     -w|      <     c||f 


(9) 


Proof.      It  is  known  [25]  that    D^^  =  D    1/2  "  H   ,      Hence,  by  Theorem 

^o 


1.7,     t'^^S  =  t""^         is  compact  in    H      and       1  S'-^T  I  ^  =  I  a"'^      fl  ^  = 

O  O  O  J  I  I  I 

=  (TT"^/\T"^/^f)  =(T^/V^^2f,T^^V^/2f)  ^  ]|f||l      Corollary  1.7  IS  now 
aji.immediate  consequence  of  Theorem  1.8, 

Remark  4.      In  Chapters  2,    3,   and  4  whenever  we  consider  the  un- 
bounded K-p.  d.   operators  it  will  be  assumed  that,   if  necessary,   these  opera- 
tors have  been  already  extended. 


(9) 

The  assertion  of  Corollary  1,  7  for  an  ordinary  Ritz  method  was 

recently  proved  by  Stadnikova  [35], 
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2.     The  Eigenvalue  Problem    Tu  -  X  Su  =  0 

In  this  chapter  an  attempt  is  made  to  present  a  theoretical  basis  for  the 
general  discussion  in  the  next  chapter  of  an  iterative  method  for  the  approximate 
solution  of  the  eigenvalue  problem  (2.  1).     Here  we  derive  some  of  the  general 
properties  of  the  eigenvalues  and  eigenelements  of  (2.  1)  and  also  consider  the 
problem  of  their  existence  under  various  conditions.     Since  some  theorems 
represent  generalizations  of  the  corresponding  theorems  for  the  selfadjoint  eigen^ 
value  problems  considered  in  [25],   the  details  of  some  parts  of  their  proofs  could 
have  been  omitted.     However,   for  the  convenience  of  the  reader  and  because  of  the 
relative  inaccessibility  of  the  monograph  [25],   the  proofs  are  given  in  full. 

The  results  in  this  chapter  represent  an  extension  to  the  eigenvalue  prob- 
lem (2.  1)  of  the  corresponding  results  for  the  selfadjoint  differential  and  abstract 
eigenvalue  problems  obtained  in  [5,  8,  10,  12,  14,  18,  25,  36].    At  the  same  time  we 
also  obtain  some  additional  results. 

2.  1.     General  remarks.      Let  us  consider  now  the  eigenvalue  problem 
in    H 

Tu    -    XSu    =    0      ,  (2.1) 


where    T    is  K-p.  d.  ,     S    is  an  operator  with    D     —J  D    ,     and    X    is  a  complex 
number.     A  value  of    X  for  which  (2.  1)  has  a  nontrivial  solution    u    in    D       will  be 
called  an  eigenvalue  of  (2. 1)  and    u    its  corresponding  eigenelement.     The  set  of 
all  eigenvalues  of  (2„  1)  will  be  denoted  by    per  (2.  1)    and  called  the  point  spectrum 
or  discrete  spectrum  of  (2.  1).     The  number  of  linearly  independent  eigenelements 
u    belonging  to  the  same  eigenvalue    X    will  be  called  the  multiplicity  of   X. 

Let  us  assume  for  the  present  that    S    is  K-real;    i.e.,     (Su,  Ku)    is  real 
for  ail    u    in    D    .      By  Lemma  1.  1,     S    is  K-symmetric  on    D    . 

Lemma  2.  1.      If    T    is  K-p.  d.   and    S    is   K-real,   then 
(a)    All  eigenvalues    X    of  (2,  1)  are  real. 
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(b)  The  eigenvectors    w    and    w'    belonging  to  distinct  eigenvalues    X 
and    X'    are  orthogonal  m  the  sense  that    (Tw,  Kw')  =  0    and    (Sw,  Kw')  «  0. 

(c)  The  set    pa  (2.  1)    contains  at  most  countably  many  eigenvalues. 

Proof.      (a)    Let    X    be  an  eigenvalue  of  {2.  1)  and    w    its  corresponding 
eigenelement.     Then    Tw  =  XSw    and  hence    (Tw,  Kw)  =  X(Sw,  Kw).      This  and 

(1.1)  imply  that    (Sw,  Kw)  ^  0    and  therefore  that 

/o    ox  ^  (Tw,  Kw) 

(2.2)  X    =    -r- — -r      . 

(Sw,  Kw) 

The  relation  (2.  2)  shows  that  the  eigenvalue  X  is  real  and  that  it  can  be  ex- 
pressed in  terms  of  the  corresponding  eigenelement.  Formula  (2.  2)  will  be 
very  useful  in  our  later  discussion. 

(b)  Let    w    be  an  eigenelement  of  (2.  1)  with  eigenvalue    X    and    w'    an 
eigenelement  with  eigenvalue    X'  3^  X.       Then,   in  view  of  Lemma  1.  1, 

0    =    (Tw.Kw')    -    (Kw,  Tw»)    =    X(Sw,Kw')    -    XMKw,  Sw") 

=    (X  -  X')(Sw,Kw') 
and 

0    =     (Sw,Kw')    -    (Kw,  Sw»)     =    x"'^(Tw,Kw')    -    X'''^(Kw,  Tw') 

=    (X"-^  -  X«"-'^)(Tw,Kw')      . 

Since    X  i^  X',     the  above  equalities  imply  the  validity  of  (b). 

(c)  If  to  each  eigenvalue  of  (2.  1)  we  order  its  corresponding  eigenelements 
normalized  in  the  metric  (1.  4),   then  the  totality  of  these  eigenelements  must  con- 
stitute an  orthonormal  set  in  the  separable  Hilbert  space    H   ,      Consequently,   the 
set    pa (2. 1)    can  at  most  be  denumerably  infinite. 

We  see  from  Lemma  2.  1  that  the  eigenvalue  problem  (2, 1)  has  at  most 

countably  many  eigenvalues.     The  problem  whether  there  exist  any  eigenvalues 

of  (2. 1)  at  all  is,   in  general,   a  difficult  one  and  will  be  discussed  below  for  some 

classes  of  eigenvalue  problems  (2.  1)  in  which  the  operators    T    and    S    satisfy 

some  additional  conditions. 
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2.  2.     K-p.  d.    eigenvalue  problem.      Let  us  assume  that  there  exist  con- 
stants   /3     >   0,     13     >   0,     and    ^     >   0    such  that 

(Su.Ku)     >     i3j|u||^      ,  u  e   D^  (2.3) 

and  either 

||Ku||^     <    32<Su,Ku)      ,  u  €  D^  (2.4) 

or    S    is  closeable  and 

||Su||^     <    |32(Su,Ku)      ,  u  e   D^      .  (2. 4«) 

The  eigenvalue  problem  (2.  1)  in  which  the  operators    T    and    S    have  the  above 
properties  will  be  called  K-p.  d.      Let    H      be  the  completion  of    D       in  the  metric 

[u,  v]^     =     (Su,Kv)      ,  |u|^     =     [u,  u]^      .  (2.5) 

It  is  not  hard  to  see  that  under  our  conditions  the  space    H      is  contained  in    H 

in  the  sense  of  identifying  elements  from    H      with  certain  elements  from    H. 

Furthermore,   the  inequalities  (2.3),   (2.4),   and  (2.4')  remain  valid  for  all    u    in 

h'     provided,   of  course,   that  for    u    in    H      we  replace    (Su,  Ku)    by    [u,  u]      and 
1  1  i 

K    and    S    are  also  used  to  denote  their  extensions  to  all  of    H      in  case  of  (2.  3)- 
(2.  4)  or  (2.3)-(2.4'),   respectively.     Since  (2. 1)  is   K-p.  d.  ,   the  functional    E(u) 
defined  by 

„,    V  (Tu,Ku) 

E(u)      =     7^ rr-r  (2.6) 

(Su,  Ku) 

is  positive  on    D       and  therefore  has  the  infimum  (inf.)  which  we  denote  by   X   . 

(        ) 

Thus,   there  exists  a  minimizing  sequence     >  U-  \      ^    D       such  that 

inf     E(u)  =    lim    E(u.)    =    X,      .  (2.7) 

^  .  1  1  o 

ueD^  1 

T 

However,   from  the  existence  of  such  a  minimizing  sequence  of  elements  we  cannot 
conclude  without  imposing  further  conditions  that  there  exists  an  element    w      in 
D       or  even  in    H      for  which    E(w  )    is  actually  equal  to    X   .      Before  we  discuss 

-  30  - 


this  question  let  us  first  observe  that  the  comparison  of  formulas  (2,  2)  and  (2.  6) 
shows  that  the  eigenvalues    X    of  (2. 1)  can  be  looked  for  among  the  values  of  the 
functional    E(u).      In  fact,   we  have  the  following  useful  lemma. 

Lemma  2.  2.      The  inf.   X      of    E(u)    is  an  eigenvalue  of  (2,  1)  if  and  only 
if  there  exists  an  element    w^  0    in    D       such  that 

(Tw  ,Kw  ) 

L.    =  X 

(Sw  ,  Kw  )  1 

Moreover,   if  such    w      exists,   then    X      is  the  smallest  eigenvalue  of  (2.  1). 

We  omit  the  proof  of  this  lemma  for,   as  will  be  seen  below,   it  will  be  a 
special  case  of  Lemma  2.  4.     Let  us  note  that  Lemma  2.  2  allows  us  to  replace 
the  problem  of  finding  the  least  eigenvalue  of  (2.  1)  by  the  problem  of  finding  an 
element  in    H    realizing  the  inf.   of    E(u)    provided,   of  course,   that  such  an  ele- 
ment exists.     To  prove  its  existence  we  assume  the  validity  of  the  following  con- 
dition:     (a):  Every  set  of  elements  in    H   ,  —  H    i    I  H,    bounded  in  the  H  -norm 
ol         o  1 o  

is  compact  in  the  H   -norm. 

Lemma  2.  3,      If  condition  (or)  is  satisfied,   then    H      is  a  subset  of   H, 

o  1 

in  the  sense  of  identifying  uniquely  the  elements  from  H      with  certain  elements 

o 

from    h'  .       In  this  sense    D_  CI  H    CZ  H, . 
1  T  o  1 

Proof.      Let    h      be  an  arbitrary  element  in    H   .      Since    D^    is  dense 

o  ,         ,      -^  o  T 


in    H      there  exists  a  sequence     <  u    >     in    D_    such  that     lu     -  h    I    —  0    as 
o  ^  (     n  )  T  'no' 

n  —  00,      Thus,      ?  ^    (      ^s  ^  Cauchy  sequence  in    H      and,   in  particular,   it  is 

boimded  in  H   -norm.     Hence,   by  condition  {a),      \  u    i     contains  a  subsequenc 
/  \  o  (     n  ) 

\  u      {     which  is  convergent  in  the  h' -metric.     Since    H*     is  complete  there 

exists  an  element    h,     in    h'     such  that    lu      -hi      =»Oasi-*oo.      Let  us 

1  1  '    n^         1 '  1 

note  that  in  view  of  Lemma  1.  2  and  the  properties  of    h'     it  follows  easily  that 

(         )  J 

every  subsequence  of     -^  u    J  ,     convergent  in  the  H  -metric,    converges  to  the 

ft 

same  element    h      in    H      and  that  any  two  equivalent  sequences  in  H   -norm 
11  o 

produce  equivalent  subsequences  convergent  in  the  H  -metric.     We  can  thus 
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assign  to  each  element    h      of   H      a  unique  element    h,     of   h',  ,     the  identification 

o  o  ^  11 

obviously  being  linear  and  such  that    h     =  h,     for  elements    h      in    D    .       Moreover, 

o         1  o  T 

this  identification  correspondence  is  also  one-to-one  for  if    h     =0,     then    h     =0. 

To  see  this  note  first  that,   by  (2.  3),      11  u     11    <    Vl3~,    |u     I      ^   0    as    i  —  oo    and,   by 

nj^"    —  1  '    nj^'  1 

(1.6),        Ku      -  Ku  <  7„   u      -  u     I    —   0    as    i    and    i    increase  indefinitely. 

"       n^  n-"   —     2*    n^         n^'  "'  '' 

This  and  the  closeability  of   K    imply  that  for  any    u    in    D 

[h   ,  u]    =    lim  [u  ,  u]    =    lim  [u     ,  u]    =    lim  (Tu     ,  Ku)    =    lim  (Ku     ,  Tu)    =    0      . 
o  n  .         n.  .  n.  .  n 

n  1111  IX 

Consequently,   we  must  conclude  that    h     =0,     as  was  to  be  shown. 

Let  us  note  in  passing  that  Lemma  2.  3  is  also  valid  under  the  weaker  con- 
dition 

(Tu,Ku)     >     a(Su,  Ku)      ,  a   >  0    ,  u   e  D         . 

However,  the  stronger  condition  (or)  is  imposed  for  it  is  essential  in  the  existence 
proof. 

Let  us  write    E(u)    in  the  form 

E(u)    -    Ili^  (2.6») 

[u,  u]^ 

and  observe  that,  in  view  of  Lemma  2.  3,  the  right  side  in  (2.  6')  which  is  valid  by 
definition  for  all  u  in  D  has  also  meaning  for  all  u  in  H  .  We  may,  there- 
fore,  use  it  to  extend  the  functional    E(u}    to  ail  of    H   .      Furthermore,   since 

o 

D_    is  dense  in    H      with  respect  to  both  the  H  -   and  h' -metric,   it  is  not  hard 
T  o  ^  o  1 

to  show  that 

inf     E(u)    =      inf     E(u)    =    X        .  (2.7) 

ueD^  ueH 

T  o 

An  element    w  s^  0    m    H      will  be  called  a  generalized  eigenelement  of  (2. 1)  be- 
longing to    X    if  it  satisfies,   for  every    u    in    H   ,     the  identity 
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(2.8)  [w,  u]    =    X[w,  u]^      ; 

the  number    X    satisfying  (2.  8)  will  be  called  a  generalized  eigenvalue  of  (2. 1). 
It  is  evident  that  every  ordinary  eigenelement    w    belonging  to  the  eigenvalue    X 
of  (2. 1)  is  also  a  generalized  eigenelement  of  (2. 1)  but  the  converse  need  not  be 
true. 

Lemma  2.  4.      If  there  exists  an  element    w.   j^  0    in    H      such  that 

1  o 

[w   ,  w    ] 

(2.9)  E(wJ    =    7 V     "    ^1      ' 

1  K^w^]^  I 

then    w      is  a  generalized  eigenelement  of  (2. 1)  belonging  to    X      and   X      is  the 
least  generalized  eigenvalue  of  (2. 1). 

Proof.      Suppose  there  exists    w,   j^  0    in    H      such  that  (2.9)  is  valid. 

1  o 

Let  us  define  the  bilinear  functional 

Q(u,  v)    -    [v,  u]    -    ^j[v,  u] 

for  all    u    and    v    in    H   .      By  (2. 7),     Q(u,  u)  >  0    for  all    u    in    H      with 
Q(w   ,  w  )  =0.      If    9    is  any  real  number,     u    any  element  in    H   ,     and 
7  -  9Q(w  ,  u),     then  using  the  fact  that    Q(w  ,  w  )  =  0    we  get 

0     <     Q(w^47u,  w  47u)    -    7Q(u,  Wj^)    +   tQ(w  ,  u)    +    [tJ    Q(u,  u) 

which,   after  substituting  the  value  for    7,     reduces  to  the  inequality 

0    1  \ 

elQ(w^,u)|      2  +  eQ(u, u)f    >   0    . 

Since  the  left  member  of  this  inequality  changes  sign  with    9    small  unless 

Q(w  ,  u)  =0,     we  conclude  that    Q(w. ,  u)  =  [w, ,  u]  -  X,  [w, ,  u],   2  0    for  all    u    in 
1  11111 

H   .      Clearly    X      is  the  least  generalized  eigenvalue  of  (2. 1).      This  establishes 
the  validity  of  Lemma  2.  4. 
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Let  us  note  that  if    w     e  D       and    u    is  chosen  to  be  any  element  in    D    , 
then  from  the  last  identity  we  derive  the  equality    (Tw  — X  Sw  ,Ku)  =  0    valid  for 
all    u    in    D    .      Since  the  set    KD       is  dense  in    H,     Tw     -  X  Sw     =  0;      i.e.,     X 


is  an  eigenvalue  of  (2.  1)  and    w      its  corresponding  eigenelement.     That    X      is 
the  least  eigenvalu 
2.2  is  also  proved. 


the  least  eigenvalue  follows  from  (2.  6)  and  the  definition  of   X   .      Thus,   Lemma 


Theorem  2.  1.  (a)  If  condition  (at)  is  satisfied,   then  the  problem  (2. 1) 

possesses  a  generalized  eigenelement    w     j^  0    in    H      belonging  to    X      which  is 

the  least  generalized  eigenvalue  of  (2. 1). 

(b)    If,   in  addition,   we  assume  that    S    is  either    H  -bounded  or  is  such 

o 

that  for  some    n     >  0 
fa 

||Su]|     <     r]g|u|      ,  u  c  D^     ,  (2.10) 

then    w      is  an  ordinary  eigenelement  of  (2. 1)  corresponding  to    X      which  is  the 
least  eigenvalue  of  (2, 1). 

Proof.      Let     |  u    >  ,     u     c   D    ,     n  =  1,  2,  .  ,  , ,     be  a  minimizing  sequence 

of  elements  for    E(u)    which  is  normalized  in  the     h' -metric.     Then    [u    I      =1 

2  if 

and    lim  E(u  )  -  lim  ju    j      "  ^^  •      This  implies  that  the  sequence     1  u     |     is 

n  n  ' 

bounded  in  the  H  -norm.     Hence,  by  condition  (or),   it  contains  a  subsequence 

which  converges  in  the  H  -metric.     For  simplicity,   this  subsequence  will  also  be 

(         ) 
denoted  by     ;  u     (  .      Let  us  note  that,   in  view  of  (2.  7),   for  any    u    in    H      we  evi- 

9  2  ° 

dently  have       ]uj      >  X   juj    •      hence 


|u     -  u     1^    -    Xju     --  u     if    >    0  (2.11) 

'    n         m'  1'    n         m' 1    — 


Theorem  2. 1  offers  an  extension  of  the  results  obtained  by  Hilbert 
and  Courant    [5]  and  Friedrichs    [8]   for  selfadjoint  (s.a. )  differential  eigen- 
value problems  and  of  Mikhlin  [25]  for  abstract  s.a.  and  positive  definite  eigen- 
value problems  in    H.      The  proof  of  Theorem  2. 1(a)  follows  essentially  the 
arguments  of  the  above  authors. 
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and 

(2.12)  lu    +u     1^    -    Xju    +u     1^    >     0      . 

'    n        m'  1'    n        m'  1    — 

It  is  not  hard  to  see  that  the  sum  of  the  two  left  members  is 

2  j[|u   |?-X    lu   Ij    +    []u     |2    -  X    lu     \h\ 
(     '    n'  1  1'    n'  1  '    m'  1'    m'  1     ) 

and  approaches  zero  as    n    and    m    increase  indefinitely.     Hence  the  left  mem- 
bers in  (2. 11)  and  (2. 12)  approach  zero  as    n    and    m    increase  indefinitely.     In 
particular 

(2.13)  <  lu     -  u     I       -     lu     -  u     J,  >     -     0      ,  as    n,    m  —    oo     . 
(  '    n         m'             '    n         m'  1  ) 

Since    H      is  complete  and     )  u    ^     is  a  convergent  sequence  in    H      there  exists 

w,    c  h'     such  that     lu     -  w,  I ,    -*   0,     as    n  -*  oo.      Then  clearly    1  w,  L   =1    and 
11  n  1 '  1  11 

u     -  u     L   —   0    as    n,   m  —  00.      This  and  (2. 13)  imply  that      u     —  u     1    -►   0    as 
'    n         m'l  .  '    n         m' 

n,    m  —  00.      Hence  the  sequence     >  u    t      converges  also  in  the  H   -metric.     In 

view  of  Lemma  2.  3,   the  sequence     )  u    ^      converges  in    H      to  the  sam.e  element 

w,    in    H   .      Thus, 
1  o 

[w   ,  w   ] 

(2.14)  X^     =    E(wJ    ^    7 T-     =    lim   E(u  )      , 

^  ^  K'^1^1 

i.  e. ,     w,     solves  the  minimum  problem.     Lemma  2.  4  implies  that    w      is  a  gen- 
eralized eigenelement  of  (2.  1)  corresponding  to    X      which  is  clearly  the  least 

1 

generalized  eigenvalue  of  (2. 1). 

To  prove  Theorem  2.  1(b)  it  is  sufficient,   in  view  of  Lemma  2.  2,   to  show 
that    w     e  D    .      To  do  this  let  us  first  observe  that  since,  by  Theorem  1.  2(a), 
w      is  a  generalized  eigenelement  we  have  for  all    u    in    H      and  in  particular  for 
all    u    in    D       the  identity 

(2.15)  [w^,u]    =    ^it^i'^^l   '  u  c   D 

which,   on  account  of  the  first  hypothesis  on    S    and  Lemma  2.  3,   can  be  written 
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in  the  form 

[w^,u]    '    X^{Sw^,Ku)      ,  u  e   D^      .  (2.16) 

Let  us  recall  that  by  Theorem  1.  1  the  domain    D       of    T    consists  of  all  elements 
in    H      realizing  the  minimum  of  the  functional    F(u)  =  [u,  u]  -  (Ku,  f)  -  (f,  Ku) 
where    f    ranges  through  all  of    H.      Therefore,   if  in  particular  we  put    f  ~  X  Sw. 
and  use  the  relation  (2.  16)  we  obtain  in  this  case  for  the  value  of    F(u)    the  quantity 

F(u)    =    [u,  u]    -    [u,w^]    -    [w^,u]    -    lu-w^l       -     |w^l         .  (2.17) 

The  relation  (2.  17)  shows  that    F(u)    attains  its  minimum  at    u  ^  w   .      This  shows 
that    w     e  D       and  therefore,  by  Lemma  2.  2,     w      is  an  ordinary  eigenelement 
belonging  to  the  smallest  eigenvalue    X      of  (2. 1). 

If  instead  of  the  first  we  use  the  second  condition  (2.  10)  on    S,     then  as 
was  shown  in  Chapter  1  the  operator    S    can  be  extended  by  closure  to     S     defined 
on  all  of    H      and  hence,   to  prove  Theorem  2, 1(b),   all  we  need  is  to  replace  in 
(2.  16)  the  operator    S    by    S.      This  completes  the  proof  of  Theorem  2.  1. 

To  determine  the  succeeding  generalized  or  ordinary  eigenvalues  of  (2. 1), 
Lemma  2. 1(b),   which  is  also  valid  for  the  generalized  eigenelements,   suggests 

looking  for  the  second  generalized  or  ordinary  eigenelement    w      of  (2.  1)  among 

1         (  ) 

the  elements  of  the  set    H    =     <  u;    u  €  H   ,     !^w,,u],   -  0  (  .      It  follows  from  (2.8) 

o        (  oil) 

that  the  condition    [w   ,  u]     ^  0    is  equivalent  to  the  condition    [w  ,  u]  =  0.       Further- 
more,  if  such  an  element    w      exists,   then 

it 

[w   ,  w   ] 
X^    «       inf     E(u}    -    E(w„)    ^ 


2         ueH^  2  tV^2^1 

o 

In  general  the  generalized  eigenelements    w  belonging  to    X        ,     if  they  exist, 

are  such  that 

^  ^.     "    Evw  ^,)    «       inf     E(u)      ,  (2.18) 

n+1  n+1  n 

o 
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where    h"  =    j  u;    uc  H  ,     [w.,  u],  =0,     i  «  1,  2,  .  .  . ,  n  (      provided,   of  course, 
o        (  oil  ) 

that  the  existence  of  the  generalized  eigenelements    w  ,  w  ,  .  . . ,  w      belonging 

to  the  increasing  set  of  generalized  eigenvalues    A.  ,  X  ,  ....  X      has  been  already 

X      ^  n 

established. 

Theorem  2.  2.      If  (2,  1)  is  K-p.  d,   and  condition  (a)  is  satisfied,   then 

(2.  1)  has  countably  many  generalized  eigenvalues    X,,X„,  .  .  .  ,X   ,  .  .  .  ,     deter- 

12  n 

mined  by  the  variational  principle  (2.  18)  such  that    X    -»  oo    as    n  —  oo;      the 

corresponding  generalized  eigenelements    w  ,  w   ,  .  .  . ,  w   ,  .  .  .     form  a  complete 

X        ^  n 

set  in  each  of  the  spaces    H  ,  H  ,  and  H. 

Proof.      We  proceed  to  establish  the  existence  of  the  generalized  eigen- 

element    w      ,    belonging  to    X     ^   -    inf      E(u)    assuming  that  the  existence  of 
n+1  "     •=  n+1  n 

ucHq 

the  generalized  eigenelements    w  ,  w  ,  .  .  . ,  w      belonging  to  the  corresponding 

X.       4^  n 

increasing  set  of  the  first    n    generalized  eigenvalues    X,  ,X_,  .  .  .,X      is  already 

12  n 

established.     Theorem  2. 1(a)  justifies  such  an  assumption. 

Let     <  u.  I  ,     u.   c  D     s  D     I    1  H  ,     i  »  1,  2,  3,  . .  . ,     be  a  minimizing 

sequence  of  elements  for    E(u)    on    H      which  is  normalized  in  the    H. -^metric. 

o  1 

Repeating  the  same  argument  as  in  the  proof  of  the  previous  theorem  we  come 

to  the  conclusion  that  there  exists    w      ,    €  H      such  that 

n+1  o 

(2.19)  !^"%+lll     "•     °      '  l^*%+ll     *     °    '     as    n  -  «   . 

Evidently,     1".)^-    jw^^J^    and    E(u.)-ju.j     -    l^^^+J^     as    n  -  oo.      Hence, 
jw^^j^jj  -  1    and    E(w^^  )  sX  Furthermore,   since    u.   c  D  ,     it  follows 

from  (2.19)  that    w^^^   c  h".      In  fact,  by  (2.  19),     [w    .,*w.]     ^  lim  [u.,w.]     ^  0 

and    [w^^^,w  ]  ^  lim  [u.,  w  ]  ^X    lim  [u.,  w.]^  =  0,     j"l,2,  ...,n.      Thus    w^ 

solves  the  minimum  problem.     To  prove  that    w  is  a  generalized  eigenelement 

of  (2. 1)  belonging  to   X  we  have  to  show  that 

(2.20)  [w  ^,,u]    ^    X      Jw     ,,u], 

n+1  n+l*^    n+1      ^1 
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for  all    u    in    H   .       Using  the  same  arguments  as  in  the  proof  of  Lemma  2.  4  and 

the  property  of  X        we  easily  derive  the  validity  of  (2.  20)  for  all    v    in    H   .      Let 

n 
u   be  an  arbitrary  element  in    H      and    w  =  u  -  /^   [u,  wj,w^.      It  is  obvious  that 


=  u  -  2_,  [u,  w.]  w..      It  is 
j=l  •'        "■ 


w  c  H      and  for  this    w    the  relation  (2.20)  is  valid.     Since 

n-1 


and 


n 


it  follows  that  the  identity 


[w,  w^^^]  ^  [u,  w        ]  -  ^   [u,  w  ]    [w     w^^  ]  =  [u.  w^^  L     i 

J=l        -^      '' 

(2.  20)  is  valid  for  all    u    in    H   .      Hence    w     ,     is  a  generalized  eigenelement  of 

0  n+1  " 

(2.  1)  belonging  to  the  generalized  eigenvalue    X        ,     which  is  clearly  the  smallest 

generalized  eigenvalue  following   X   .       It  is  obvious  that,   since  the  set    H      CIh'     is 

separable  and  infinite  dimensional  and     >w     <      is  an  orthonormal  set  in    H   , ,     the 

{     n  )  ol 

set  of  corresponding  generalized  eigenvalues     )  ^    (      is  countable  and  has  infinity 

as  its  only  limit  point.     In  fact,   if   X    —  X     <  oo,     then    Iwl      =x|w|,=X     <X, 
,.  ,  n  o  'n^  n^n'l         n-o 

where     >  w    !'      is  the  H  -orthonormal  sequence  of  generalized  eigenelements  be- 

(         )  (         ) 

longing  to        >  X     '•  .      Thus,      i  w     <      is  bounded  in  the  H   -norm  and  therefore,   by 
^  (n)  '^(n)  o  '      ^ 

condition  (a),   is  compact  in    H. .      But  this  is  impossible  since     ]  w     {      forms  an 

1  (      n  ) 

orthonormal  sequence  in    H  . 

1 

To  prove  the  second  part  of  Theorem  2.  2  note  that,   in  view  of  Lemma  2.  3, 
it  is  sufficient  to  show  that  every  element  in    H         can  be  approximated  arbitrarily 
closely  (in  the  sense  of  the  norms  in    H  ,   H  ,   and  H)    by  some  linear  com.bination 


(         ) 
(possibly  different  in  each  case)  of    ^  w    ?  . 


n 

Let    u    be  any  element  in    H   „     and    u     =  ^      [u,  w.l,  w..      If  we  put 
•^  ol  n      9-i  ill 

1=1 

v     =  u  -"  u  ,     then    [v  ,  w],   =0    and    [v  ,  w.l  -  0    for    i  =  1,  2,  .  .  . ,  n.      Hence 
n  n  n      J  1  n      ] 

V     c  H      and  therefore,   by  (2. 18), 
no 

^  Iv    1^ 

Vl     <    7^  -  T^    >     Kll      '  <2.22) 

n+1 


'X 


38  = 


Since    Ivl      =|u-u|      ^lul     -Jul      <|u        and   X     ,  —  «,      as    n  —  oo,      (2. 22) 
'    n'         '  n'  '    •  '    n'  '    '  n+1 

yields 

(2.23)  Ju  -  u    J,    -    0      ,  as    n  -  «     . 

^  n'  1 

This  and  (2.  3)  also  show  that 

(2.24)  ]|u-u    II     -    0      ,  as    n  -  00     . 


Furthermore,   if    m  >  n,     then  the  inequality    0<u      --u         =u  -u| 

I     |2  (  1       l2)  -   '    m         nJ         J    m'  '    n' 

<    Juj       shows  that  the  sequence     )  Ju   j     ?     is  monotonically  increasing  and 

bounded.     Hence      u      -  u       —  0,     as    n,    m -*    oo.      Since    H      is  complete, 

'    m         n'  o 

Lemma  2.  3  and  the  relations  (2.  23)  and  (2.  24)  imply  that 

(2.25)  ju  -  u   I    -    0      ,  as    n  -  00     . 

'  n' 

Thus,   from  (2.  23),   (2.  24),   and  (2.' 25)  we  see  that  the  set  of  linear  combinations 

of     f  w.  ^     is  dense  in    H   ,     in  the  sense  of  all  three  norms.     This  completes 
(      1  )  ol 

the  proof  of  Theorem  2.  2. 

Theorem  2.  3.      If  in  addition  to  the  hypothesis  assumed  in  Theorem  2.  2 
we  also  assume  that    S    satisfies  the  conditions  of  Theorem  2.  1(b),   then  the  prob- 
lem (2. 1)  has  countably  many  eigenvalues    X      such  that    X    —  oo,     as    n  —  oo,     and 
the  corresponding  eigenelements    w      form  a  complete  set  in  each  of  the  spaces 

H  ,   h'     andH. 
o       1 

Proof.      Theorem  2,  3  follows  from  Theorem  2.  2  and  the  fact  that  under 

the  additional  conditions  on    S    all  elements    w.    belong  to    D    .      The  proof  of  the 

latter  assertion  is  exactly  the  same  as  the  proof  of  Theorem  2. 1(b).     Lemma  2.  2 

completes  then  the  proof  of  Theorem  2.  3. 

Before  we  go  on  in  our  discussion  let  us  remark  that  in  view  of  the  im- 
portance of  condition  (a)    it  is  interesting  to  know  whether  it  can  be  replaced  by 
any  other  condition.     To  that  end  we  prove 
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Lemma  2.  5.      Let    S    satisfy  conditions  (2.  3)  and  (2.  4).     If    S    is  also 
H  -bounded  or  is  closeable  and  such  that 


l|Su||     fl^lulj      ,  u  e   D^     ,  (2.26) 


then  condition  (a)  is  satisfied  if  and  only  if    T     S      is  compact  in    H  ,     where    S 
_  coo 

is  either    S    or    S. 

Proof.      Assume  that  condition  (a)  is  satisfied.  Let    Q      be  an  arbi- 
o 

trarily  bounded  set  in    H  ;     i.  e. ,   there  exists  an    M     >  0    such  that 

o  o 

luj     <     M       ,  u  e  Q       .  (2.27) 

'    '     —        o  o 

By  condition  (or),     Q      is  compact  in  the  H  -metric.     Hence  we  can  extract  from 

(         f 
it  a  sequence     i  u    >      such  that 

lu      -  u    I       -    0      ,  as    m,   n  -  00     .  (2.28) 

'    m         n'  1 

Using  (1.  4)  and  Lemma  1.  1(c)  we  get 

It'^^S  (u      -u)!^     ^     (S  (u     -u  ),  KT'^^S  (u     -u  ))^ 

'omn'  omn  omn  ,„   „„. 

(2.29) 

0  1  1 

<     lu      -  u    J,  (S  T~   S  (u     ^u  ),KT"   S  (u     -u  ))    . 
—     '    m         n'l      o  omn  omn 


In  view  of  Lemma  1.  5,   Lemma  1.  4  or  (2.  26),   the  inequality  (2.  27),   and  (2.  28) 
we  derive  from  (2.  29)  the  relation 

It^^s  (u    -u  )]^   <    4m\]|s  t"^]|  •IIkt'MI  .|u     -ulf  «  0 

'  omn'—  o  5"    o         "     "  "     '    m         n  ^  1 

as    m,   n  —  00.      Hence  the  operator    T     S      is  compact  in    H   . 

o  o 

To  prove  the  converse  assume  that    T     S      is  compact  in    H      and    Q   , 

o  o  ol 

is  an  arbitrary  set  of  elements  in    H   ,     which  is  bounded  in  the  H  -norm,      i.  e. , 
-^  ol  o 


(11) 

This  part  of  Lemma  2.  5  remains  also  valid  if    S    satisfies  the  weaker 

conditions  specified  in  Theorem  2.  Kb). 
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there  exists    M  ,   >  0    such  that 
ol 

(2.30)  |u|     <     M^^      ,  u  €  Q^^      . 

Since    T~   S      is  compact  in    H      we  can  extract  a  sequence     ]  u     [      from    Q 

o  o  i;     n  I  oi 

such  that 

(2.31)  It"   S  (u      -J-  u  )|     -   0      ,  as    m,   n  -  00     . 

'  o     m         n  ' 

Let  us  also  observe  that  our  conditions  on    S    imply  that  the  formula 

(2.32)  [n,v]^    ^    (S^u,Kv) 

is  valid  for  all    u,   v  e  H   .      In  fact,   it  is  valid  by  definition  for  all    u,    v  e  D 
and  by  the  continuity  of    S   ,     as  an  operator  from    h'     to    H,     it  remains  valid 
for  all    u,    V  e  h'         Therefore,   by  (2.  30)  and  (2.  31), 

ju      -u    I?    =    (S  (u     -u  ),K(u     =-u   ))     =    [t"    S  (u     -u  ),u     -u    ] 
'm         n'l  omnmn  omnmn 

<   2M  It"   S  (u     -u  )|     -    0      ,  as    n  ■*  00     „ 

—        o  o     m       n  ' 

Hence,      >  "    <      is  a  convergent  sequence  in  the  H  -metric;      i.e.,     Q        is 
compact  in  H  -metric  and,    consequently,    condition  (a)    is  satisfied. 

Let  us  note  that  if    D.,  HD  D      and  the  inequalities  (2.  3)  and  (2.  4)  are 

K.  o 

valid  for  all    u  €  D  ,     then    S    is  K-p.  d.   and  the  completion  of    D      in  the  metric 

S  o 


(2.5)  is  a  Hilbert  space    H   C_  H.       Using  Theorem  1. 1  we  can  extend    S    to  the 

J. 

K-p.  d.   So  g.   Friedrichs  extension.     Let  us  denote  this  extension  by    S.      In  this 

case  we  can  also  approach  the  problem  of  this  section    in  the  following  way. 

-1 
Since    S        exists  and  is  defined  on  all  of   H    we  can  apply  it  to  equation  (2. 1)  and 

obtain  an  equivalent  equation 

(2.1a)  s"'^Tu    -    Xu    =    0      ,  u  e   D     CZ  H       , 

in  which    A  =  S     T,     considered  as  an  operator  in    H   ,     is  symmetric  and  posi- 
tive.    In  fact,     [Au,  v]     =  (Tu,  Kv)  -  (Ku,  Tv)  -  (Ku,  SAv)  -  [u,  Av]    for    all     u 
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and    V    in    D    .      Thus,    we  can  introduce  the  space    H       as  the  completion  of 
D^    in  the  metric    [u,  v]^  =  [Au,  v]  Since    [u,  v]      =  [Au,  v]     =  (Tu,  Kv),     we 

see  that    H      =  H   .      The  positive  operator    A    can  be  extended  to  a  selfadjoint 
operator  in    H   .      Let    A    also  denote  this  extension  and  let  us  regard  the 
eigenvalues  and  eigenelements  of  (2.  1)  to  be  the  eigenvalues  and  the  corre- 
sponding eigenelements  of  the  extended  operator    A.      Since    H     =  H  ,     our 

A         o 

condition  (or)  is  unchanged;    i.  e. ,    every  set  bounded  in  the    H    -metric  is  com- 
pact  in  the  H   -metric.     Thus  we  can  extend  to  the  eigenvalue  problem  (2.  la) 
with  extended    A    all  the  results  obtained  in  [25]  for  the  selfadjoint  and  positive 
definite  eigenvalue  problem. 

Let  us  note  that  the  inconvenience  of  such  an  approach  to  the  problem 
(2. 1),    even  in  case    S    is  K-p.  d. ,   is  that  we  do  not  know,   in  general,   under 
what  conditions  will  the  eigenvectors  of    A    satisfy  the  equation  (2. 1)  in  the 
usual  sense  and  furthermore  it  requires  first  the  extension  of    S    and  then  the 
computation  of  its  inverse    S        which  generally  presents  a  practical  difficulty. 


2.  3.     K-real  eigenvalue  problem.       In  this  section  we  consider  the  prob- 
lem of  the  existence  of  eigenvalues  and  eigenelements  of  (2.  1)  under  conditions 
different  from  those  assumed  in  the  previous  section.     Our  basic  assumption 
here  is  the  condition 

(j3):    The  operator    G(X)  =  T  -  XS    is  continuously  invertible  for  all  com- 
plex numbers    X    except  possibly  for  the  eigenvalues    X    of  (2.  1). 

Let  us  observe  in  passing  that,   for  example,    condition  (/3)  will  hold  if 
the  spectrum    ct(2.  1)    of  (2. 1),   i.e.,   the  set  of  all    X    for  which    GKX)    is  not 
densely  invertible,    contains  only  eigenvalues  and    S    is  a  bounded  operator  in    H. 
Indeed,   if  this  is  the  case,   then  for  any    X    in  the  resolvent  set    p  (2.  1)    of  (2.  1) 
the  range  space    R       of    G{X)    is  dense  in    H    and  the  inverse    G        exists  and  is 
bounded  on    R    .       Consequently,   to  obtain  condition  (jS)  it  is  thus  sufficient  to 


show  that    G(X)    is  closed  for  each    X  c  p  (2. 1).      Let     5  u    |      be  a  sequence 


in 
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D^  =  D^    and    f    in    H    such  that       u     -  u     -  0    and       Gu     -  f     -  0,  as    n  -  qq, 
G         T  "    n  ^'  ''       n         ^' 

Then,    since    S    is  bounded  and    u    —  u,     Tu     =  Gu    +  XSu     —  f  +  XSu,     as    n  —  ao. 

n  n  n  n 

Hence    u  e  D      =  D       and    Tu  -  f  +  XSu;      i.  e, ,     Gu  =  f,     G    is  closed,   and 
T  G 

R^-H. 

It  will  be  seen  below  that  condition  (j3)  allows  us  to  reduce  the  investiga- 
tion of  the  eigenvalue  problem  (2,  1)  to  the  class  of  symmetric  eigenvalue  prob- 

(12) 
lems  investigated  by  Heuser  [12]  and  Kharazov  [18].  To  that  end  we  consider 

the  space    D[T]    introduced  in  §  1.  1.     Although    D[T]    is,   in  general,   not  complete 

it  satisfies  all  the  conditions  imposed  on  the  space  considered  in  [12,    18].      Let 

us  also  define  the  operator    N  -  T      S    which  maps    D[T]    into  itself.     It  is  not 

hard  to  see  that  the  definition  (1.  4)  of  the  metric  in    D[T]    and  the  K-symmetry 

of    S    imply  that    N    is  a  symmetric  operator  in    D[T].      In  fact, 

[Nu,  v]    =    (Su.Kv)     =    (Ku,  TNv)     ^    [u,  Nv]      ,  u,    v  €    D[T]      . 

By  means  of  the  operator    N    the  problem  (2.  1)  can  be  written  in    D[T]    in  the 
form 

(2.  la)  u    -    XNu    =    0 

or 

(2.1b)  Nu    -    MU    =    0      ,  ^^T      '  X  ?t  0      . 

It  is  obvious  that  if   X  5^  0    is  an  eigenvalue  of  (2. 1)  with  multiplicity    m    and 
^^,...,^        are  linearly  independent  eigenelements  belonging  to    X    then,   since 
juj   =  0    if  and  only  if    jjuj]   =0,     <^    ,  .  .  . ,  (^        are  also  linearly  independent  eigen- 
elements in    D[T]    belonging  to  the  eigenvalue   ju  -  l/X    of    N.      The  converse  is 
also  true  provided  that  the  eigenelements  belong  to    D[T].      Furthermore,   from 


(12) 

Both  papers  are  essentially  an  extension  of  the  results  obtained  by 

Wieland  [36].     In  his  study  Kharazov  used  the  arguments  applied  by  Kamke  [14] 

to  the  differential  and  selfadjoint  eigenvalue  problem  (2. 1).     The  distinct  and 

interesting  feature  of  this  argument  is  that  the  existence  proof  does  not  involve 

the  use  of  the  variational  principle  for  the  functional    E(u). 
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the  relation 

N    -    M    =    T"^S-tjT)      ,  ^=^     '  ^2.34) 

we  conclude  that    X  ^  0  i  p0(2. 1)  if  and  only  if   ii  ^  0   ^   Pct(N).      The  relation  (2.34) 
and  condition  (|3)  also  imply  that  if  pt  j^  0    is  not  an  eigenvalue  of    N    then  the  opera- 
tor   N  —  /Lt  maps    D[T]    onto  itself;    i.  e. ,   the  operator    N    satisfies  the  property  (E) 
imposed  by  Heuser  [12].     Also,   the  quantity    [Nu,  u]  -  (Su,  Ku)    cannot  be  equal  to 
zero  for  all    u    in    D[T]    unless    Su  =  0    for  all    u    in    D       for  otherwise  from  the 
identity 

(S(u+v),K(u+v))    -    (S(u-v),K(u-v))    +    i(  S(u+iv),  K(u+iv) )    -    i(  S(u-iv),  K(u-iv) ) 

=    4{Su,  Kv)      ,  u,   V  e   D^     , 

we  would  have  to  conclude  that    (Su,  Kv)  =  0    for  all    u    and    v    in    D    .      Since    KD 
is  dense  in    H,     this  would  imply  that    Su  =  0    for  all    u    in    D    . 

Thus,  the  operator    N    in    D[T]    satisfies  all  the  conditions  imposed  by 

Heuser.     Consider  also  the  space    H     ID   D[T]    and  let     N     denote  the  closure 

o  

of    N    in    D[T]    with    D[T]CI  D— d  H   ,      Then  using  the  results  of  [12]  it  is  now 
not  hard  to  derive  the  following  two  theorems. 

Theorem  2.  4.      (a)  The  operator     N    is  the  unique  selfadjoint  extension 
of    N. 

(b)  The  spectrum    «7(N)    of    N    is  a  nonempty  closed  set  of  real  numbers 

such  that    a(N)  =  or(N),     pa(N)  -    !  0  (    =  pct(N  )  -    j  0  (  ,     ca(N)  =  ca(N),     and 

(       )  -  ^       ^  ^       ^ 

ra(N)  -    ^  0  >    =  ra(N)  =  4»,     where    ca(N)    and    r0(N)    denote,    respectively,   the 

(13) 

continuous  and  the  residual  spectrum  of    N    and    ^    is  the  empty  set. 

(c)  The  set    ca(N)  -     »  0  '      consists  exactly  of  all  nonzero  limit  points  of 
the  eigenvalues  of    N    which  are  not  eigenvalues  themselves. 


(13) 

Let  us  recall  that    u   e   ca(N)    <$=-=>    R  ^        is  dense  in    H      and 
_^  N-M  o 

(N-u)         is  unbounded  on    R^^        while   u   e   ra(N)    <$==>    R  ^        is  not  dense  in    H   . 
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Theorem  2.  5,      The  operator    N    has  at  least  one  nonzero  eigenvalue. 

Corollary  2.  1.      The  problem  (2.  1)  has  at  least  one  nonzero  eigenvalue. 

We  see  from  Theorem  2.  4  and  Theorem  2.  5  that  the  passage  from  the 
eigenvalue  problem  (2.  1)  in    H    to  the  eigenvalue  problem  (2.  lb)  in    D[T]  d  H 
changes  in  general  the  nature  of  the  spectrum    a  (2.  1)    of  (2. 1)  from  the  pure 
point  spectrum  of  (2.  1)  to  the  mixed  spectrum  of    N.       Furthermore,    it  was 
shown  in  [12]  that  the  set  of  eigenelements  in    D[T]    belonging  to  the  correspond- 
ing nonzero  eigenvalues  of    N    or  (2.  1)  is  not,   in  general,   large  enough  to  per- 
mit the  expansion  of  each  element  of  the  form    Nu  =  T     Su    into  a  Fourier  series 
with  respect  to  these  eigenelements. 

However,   if  in  addition  to  condition  (/3)  it  is  known  that 

(7)    The  spectrum    ct(N)    of   N,     considered  as  an  operator  in    D[T], 
contains  only  eigenvalues  of  finite  multiplicity  with  zero  as  its  sole  possible 
limit  point  then  from  the  results  obtained  in  [18]  for  the  operator    N    in    D[T] 
and  from  the  above  discussion  it  is  not  hard  to  derive  for  the  problem  (2.  1)  the 
following  additional  theorem. 

Theorem  2.  6.      (a)  The  set  of  eigenvalues     |xj<|x|<...<|x| 
'    1'   —   '    2'   —  —   '    n' 

<  . ,  .     with  the  corresponding  set  of  eigenelements    w, ,  w„,  .  .  . ,  w   ,  ,  ,  .     normed 
~  1       2  n 

in  the  sense  of  the  metric  (1.4)  possesses  the  following  extremal  property:    On 
the  set  of  elements    u  e  D[T]    satisfying  the  condition  [u,  w.]  -  0    or    [u,  w.]     =  0 
for    i  =  1,  2,  .  . . ,  n-«l,     the  absolute  value  of  the  functional 

(Su,  Ku) 


(2.35)  Mu)    ^ 


(Tu,  Ku) 


attains  its  maximum  equal  to     L    I   =  1/  JX    I ,     n  -  1,  2,  3,  ... ,     at    u  -  w   . 

n  '    n  n 

Furthermore,   the  eigenvalues  found  by  this  variational  method  exhaust  entirely 
the  set    p«t(2.  1). 

(b)    For  any    u    in    D[T]    we  have  the  representation  (generalization  of 
the  Hilbert-Schmidt  theorem) 


-  45 


00  00 

^    (Tu,  Kw  ) 


T'^Su    =     2_^ w      -      /        (Su,Kw,)w.  (2.36) 

1=1  1  1=1 

which  converges  in  the  H   -   and  H- metric  to    T     Su. 

(c)    For    X  i   pa (2.  1)  the  unique  solution    u    in    D       of  the  equation 
Tu  -  XSu  -  f,     for  any    f    in    H,     is  representable  in  the  form 

00 

u    =    T"-^f    +   X/-^   .     ^   .    (f,Kw.)w  (2.37) 

■^-T^    A.   -  A  1      1 

1  =  1  1 

converging  in  the  H   -metric. 

Let  us  observe  that  using  the  properties  of  the  H  -bounded  operators 
established  in  Chapter  1  we  can  prove  a  stronger  assertion  than  that  contained  in 
Theorem  2.  6(b)  and,   at  the  same  time,    derive  the  necessary  and  sufficient  condi- 
tion for  the  set  of  eigenelements,   determined  by  Theorem  2.  6(a),  to  be  complete 

both  in    H      and  in    H. 
o 

Lemma  2.  6.      If    S    is  H   -bounded,   then  for  every    u  €   D[T]    the  series 

00  00 


Su    =     /    ,    (Tu,Kw.)Sw     =     /        [u,  w.]Sw  (2.38) 

1=1  1=1 

converges  in  the  H-metric  to    Su. 

Proof,      Since    [w.,w.]  -  6  .    we  have  for  any    u  c   D[T1    the  Bessel's  in- 

1      1  11  ''  ■■    •* 

n  9  9 

equality    ^2    j[u>w]]      <   ]uj       valid  for  each    n.      This  and  the  completeness  of 
1=1  ^ 

H      imply  that  there  exists  an  element    v    in    H      such  that 
o  o 


00 

(2.39) 


V    =     /         [u,  w.lw. 
'T-rr'  11 


i=l 


Since    S    is   H  -bounded.   Lemma  1.  4  and  (2.  39)  imply  that 
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n  n 

as    n  -►  oo; 


Sv  -   /_^  [u,  w.]Sw.||     <     n^lv-   /_^  [u,  w.]wj    -    0    , 


i=l  i=l 

i.  e. ,   the  series 


00 


(2.40)  Sv    =     /    ,  [u,  w.]Sw, 

i=l 

converges  to    Sv    in    H.      Using  the  fact  that    T        is  bounded  in    H    and 

-1  -1 

T     Sw    =  X     w     we  derive  from  (2.40)  the  series 
i        i       1 

(2.41)  T     Sv    =     /_^  — w.      . 

i=l  i 

Taking  into  account  the  proved  formula  (2.  36)  we  get 

T     (Sv  -  Su)    =     2-^  ~^ w.    -    T     Su    =    0      , 

15=1  1 

whence,   in  view  of  (2.  40),   we  obtain  the  expansion  (2.  38). 

Theorem  2.  7.      If    S    is   H  -bounded,  then  the  set  of  eigenelements 
^  w.  ^     belonging  to  the  eigenvalues     )  ^-  <      determined  by  Theorem  2.  6(a)  is 
complete  in    H      if  and  only  if    Su  i^  0    whenever    u  e   H      and    u  f^  0. 

Proof.     Necessity.     We  have  to  show  that  the  completeness  of     >  w.  ? 

in    H      implies  that    Su  ^  0    whenever    u  c  H      and    u  ^  0.      Assume  the  contrary; 

°    (         )  ° 

i.e.,      /  w.  V     is  complete  and    Su*  -  0    for  some    u*  j^  0    in    H   .      Then  for  every 

i    we  have 


[w.,u*]    =    (Tw.,Ku*)    -    X.(Sw.,Ku*)    ^    X.(Kw.,  Su*)     =    0 


11  11 


This  contradicts  the  assumption  that     7  w.  ^     is  complete  in    H   .      Hence  we  must 
have    Su*  ^  0. 
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Sufficiency.      Let  us  first  observe  that,   since    D[T]    is  dense  in    H  ,     to 
prove  the  completeness  of     )  w.  ^     in    H      it  is  sufficient  to  show  the  validity  of 
the  expansion 


00 


u    =     /    ^  [u,  w.]w.  (2.42) 

. — r^  1      1 


i=l 

for  every    u    in    D[T].      Suppose  that    Sw  ^  0    whenever    w  €  H      and    w  3^  0. 
Since     ]  w.  >      is  an  orthonormal  sequence  in    D[T]  CZ.  H  ,     the  sequence 

I   2_,    [u»w.]w    I      converges  in  H   -norm  to  some  element    v    in    H  ;      i.e.. 


i=l 

00 


i=l 
Hence,   by  Lemma  1,  4, 


V    =     /        Tu,  w.]w. 
. — .  1     1 


00 


Sv    =      /       [u,  w]Sw.      .  (2.43) 

. — 7^  1        1 

1=1 

Taking  into  account  Lemma  2.  6  we  obtain  from  (2.  43)  the  equality 

00 
S(v-  u)  =  ^_^   [u,  w.]Sw.  -  Su  -  0.      Our  assumption  then  implies  that    v  =  u; 
i-1 

i.  e. ,       )  w.  {      is  complete  in    H   . 
(      1  )  o 

Corollary  2.  2.      If    S    is  H  -bounded  and    Su  ^  0    whenever    u  c  H      and 

(         ) 

u  ^  0,     then  the  set      >  w.  >     of  eigenelements  determined  by  Theorem  2.6(a)  is 

complete  in    H. 

Proof.      The  proof  follows  from  Theorem  2.  7,   the  inequality  (1.  1),   and 

the  denseness  of    H      in    H. 

o 


48  - 


3.     Generalized  Moments  Method 

In  this  chapter  we  indicate  briefly  the  applicability  and  the  unifying  prop- 
erty of  the  generalized  moments  method  (GM-method)  for  the  approximate  solution 
of  the  eigenvalue  problem 

(3.1)  Tu    -    XSu    =    0      , 

where   T    is  K-p.  d.   and    S    is  such  that    D     ^D   D    .      This  will  allow  us  to  extend 
the  applicability  of  the  GM-method  to  non-selfadjoint  differential  eigenvalue  prob- 
lems of  an  even  and  odd  order. 

3.1,     Some  general  lemmas.      To  justify  the  GM-method  we  shall  first 
discuss  the  relevant  results  concerning  the  eigenvalue  problem 

(3.2)  Nx    -    MX    -    0      , 

where    N    is  a  compact  operator  in  some  Hilbert  space    X. 

(  ) 

Let     *  X     ^     be  a  projectionally  complete  sequence  of  expanding  and  finite 

dimensional  subspaces  of   X    and  let    P      be  the  orthogonal  projection  of   X    on    X   . 

By  the  approximate  eigenvalues  and  eigenelements  of    N    we  mean  the  eigenvalues 

fi         and  the  corresponding  eigenelements    w        e  X      of  the  operator    N     =  P  N: 

/   \  /   V  ^  n         n 

1.  e. ,    n         and    w         are  such  that 

(3. 3)  N  w  -    li       w  =    0      , 

n 

(  ) 

Let  us  first  note  that,   since     >  X    ^     is  projectionally  complete  in   X    and    N    is 

(        y     ^  ' 
compact,  the  sequence     jN   f      converges  uniformly  to    N  [32];    i.e., 

(3.4)  ||n     -  n||    -    0      ,  as    n-  «      . 

The  following  lemmas  due  to  Polsky  [27]  summarize  the  relevant  prop- 
erties of  the  eigenvalue  problems  (3.  2)  and  (3.  3). 
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Lemma  3.  1.      The  set  of  eigenvalues  of    N    is  identical  with  the  set  of 
limit  points  of  all  sequences  of  approxinaate  eigenvalues. 

Lemma  3.  2.      Let    n    be  an  eigenvalue  of    N    and   ju         of    N      such  that 
H        -*  ii.       From  every  sequence  of  normed  approximate  eigenelements     ^  w       > 
belonging  to   pi         we  can  extract  a  convergent  subsequence  so  that  its  limit    w    is 
an  eigenelement  of    N    belonging  to   /n. 

Remark  5.       It  was  shown  in  [27]  by  an  example  that  for  an  arbitrary  com- 
pact operator  it  is  not  possible  to  obtain  every  eigenelement  as  a  limit  of  approxi- 
mate eigenelements.     However,   any  element  in  an  invariant  subspace  of    N    can  be 
obtained  as  a  limit  of  a  sequence  of  elements  belonging  to  the  corresponding 
approximate  invariant  subspaces.     This  implies  that,   in  particular,   if  the  eigen- 
values    /u    of    N    and  their  corresponding  approximations    ^         of    N      are  simple, 
then  any  eigenelement  of    N    can  be  obtained  as  the  limit  of  the  corresponding 
approximate  eigenelements  of    N   . 

It  was  already  noted  that  Lemma  3.  1  in  this  generality  was  first  proved  by 
Polsky  [27]  by  means  of  the  theory  of  analytic  functions.     For  the  sake  of  complete- 
ness and  in  view  of  the  difficulty  in  obtaining  his  paper  we  prove  below  two  slightly 
more  general  lemmas  concerning  a  bounded,   not  necessarily  compact,    operator    N 
so  that  Lemma  3. 1  will  follow  from  them  as  a  corollary. 

Lemma  3.3.      Let    N      be  a  sequence  of  bounded  operators  on    X    con- 
n  .   . 

verging  uniformly  to  a  bounded  operator    N.      If   /^         e  a(N  )    and   n        —  /lz*    as 
n  —  QO,     then    p  '    e  a(N). 

Proof.      Suppose,    contrary  to  the  assertion,   that    pt  c  <y(N).      Then 

pt*   e  p(N)    and,    since    p(N)    is  open,   there  is  a  neighborhood  U    of   pt*    belonging 

to    p(N)    and  a  constant    c  >  0    such  that    |j(p-N)x|j    >  cjjxjj  for  all    p    in    U    and 
X    in    X.      Hence 

||(p-N^)x]|     >     ||(M-N)xl     -     |]{N^<-N)x||     >     (c-  ]1n-N^J|)1|x)|      . 

pt   c   U     ,  X  c  X     . 
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Since       N     -  N     —  0,     as    n  —  oo,     there  is  an  integer    n     >  0    such  that 
"     n       .  "  o 


1  "T 

IIn     -  NJI    <   -  c    for    n  >  n   .  Therefore,     \\{n  -  N  )xj|    >   -  cjlxjl     and 
ji     n           ji          2                      ~i     °  n     "  2      "     " 

11  (At  -  N)"   (N     -  N)x||    <   -r  ||x||  for  all   /j    in    U    and    x    in   X    so  that  for 


n  >  n      the  equation    y  =  (/j  -  N  )x  =  (ju  -  N)x  +  (N  -  N  )x    is  uniquely  solvable 
—     o  n  n 

for  each   ju    in    U    and  each    y    m    X.      This  is,   however,   impossible  since 

fj.     —  pt*,     as    n  —  00.      Thus,     /i*   e  a(N)o 
n 

Lemma  3.  4.      If    ||n     -  NJ|  —  0    and   jU*    is  an  isolated  point  of  the 

spectrum    a(N),     then  there  exists    fj,        e  a(N  )    such  that   /j        •*  U   ,     as    n  —  oo. 

n 

Proof.      Assume,    contrary  to  the  assertion,   that  there  is  a  subsequence 

I  N     >     of     <  N     >  ,     n   ,n   ,  .  .  .  ,   —  qo,     and  some  neighborhood    U    of  ju*    such 

that    a(N     )  (    I  U  =  ^.      Since   /i      is  an  isolated  point  of   a(N)    we  can  take    U    so 

"i  ^ 

that  it  contains  no  other  point  of  a(N)    except   /u    .      Let    C    be  a  circle  with 


center   pt*    and  radius    r]  >  0    so  that    C    lies  in    U.      It  is  not  hard  to  see  that 
there  is  a  constant    c     >  0    such  that 


(3.5)  J|(M-N)x||     >     cjjxjl       ,  H^C    ,  xcX    . 

Furthermore,    since  for   ^t   c   C    and    x  c  X 

IHm-N^)x||     >     ||(|U-N)x||     -     |](N-"N^)x||     >     (c^-  Un-N^  ||)|)x|| 


and       N  —  N      !  —  0,     there  is  an  integer    n     >  0    such  that 
"  n."  °  o 

1 

(3.6)  ||(m-N^)x||     >     I  cj|x||      ,  pec,        xcX 

i 

This  shows  that  the  resolvents    R(ju)  =  (^i  -"  N)         and    R    (p)  -  (ju  -  N    )         are 

n,  n. 

1  1 

uniformly  bounded  for  all    ju    c   C    and    n.   >  n   .       From  the  identity 

R     (ix)    ~    R(m)    ^    R(m)(N  «  N     )  R     (ju)      ,  |u   e   C 

n.  n.      n. 

1  11 

and  the  inequalities  (3.5)  and  3.  6)  we  obtain  for  all    n.  >  n      and   u    e   C 

1  —     o 
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||r^(m)  -  R(p)jl     <^||n-N^!|     ^    0  (3.7) 


as    n    —  00,      The  relation  (3.7)  shows  that  the  sequence     \  R    (pt)  >     of  analytic 
functions  converges  uniformly  on    C    to  the  function    R(p).      The  properties  of 
analytic  functions  imply  that  the  resolvent    R(^),     being  a  uniform  limit  on    C    of 
\  R    (ijl)  >  ,     will  be  also  analytic  in  the  interior  of    C.      This,   however,    contra- 
diets  the  fact  that   /u      is  an  isolated  point  of   a(N),     i.  e. ,   that    pi*    is  an  isolated 
singular  point  of    R(|i)    which,   as  is  known,    is  either  a   pole   of     R(/u)     (in  which 
case  it  is  an  eigenvalue  of    N    of  some  finite  multiplicity)  or  an  essential  singu- 
larity of    R(m).       This  contradiction  completes  the  proof  of  Lemma  3.3. 

Remark  6.      It  is  easy  to  see  now  that  Lemma  3.  1  is  an  immediate  con- 
sequence of  Lemma  3.3  and  Lemma  3.4.     Indeed,   since    N    is  compact  in    X,     its 
spectrum    a(N)    consists  at  most  of  a  countable  number  of  eigenvalues  of  finite 
multiplicity  which  are  themselves  isolated  points  of   ct(N);      furthermore,   the  se- 
quence of  operators      <  N     ^    =    |  P  N  ^'     converges  uniformly  to    N.      Thus,   if   N 
is  compact,   the  hypotheses  of  Lemma  3.  3  and  Lemma  3.  4  are  satisfied;    their  con- 
clusions yield  the  validity  of  Lemma  3.  1. 

3.  2.     GM- method.      In  this  section  we  consider  the  problem  of  the  approxi- 
mate solution  of  the  eigenvalue  problem  (3. 1)  by  means  of  the  GM- method  [26].     The 
essence  of  this  method  consists  in  the  following.     We  choose  a  system  of  linearly 
independent  elements    <^ .   e  D    ,     i  =  1,  2,  .  . . ,     which  is  complete  in    H  ,     and  then 
construct  another  system    K^..  The  approximate  eigenelement        w         of  (3.  1) 


(14) 

It  is  easy  to  see  that  in  this  case  the  system    K^.,     i  =  1,  2,  .  .  . ,     is 

complete  in    H.      In  fact,   if    v    is  any  element  in    H    such  that    (v,  K^  )  -  0,     then  by 

Theorem  1.  1  there  is  an  element    w  c   D       such  that    (v,  K<i)  )  -  (Aw,  K^i.)  -  [w,^.]  =  0 

for    i  =  1,  2,  . . . ,  n,  .  . .    .      Since     ]  <^ .  i     is  complete  in    H  ,     w  -  0    and,    conse- 

,       (   «i  )  o 

quently,     v  =  Aw  =  0;      i,  e. ,      }  ^'^ ■  \      is  complete  in    H.      It  is  known  [26]  that  the 


converse  is  also  valid  if   D      =  D    . 

K  T 
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is  taken  in  the  form 

n 

(3.9) 

(n) 

^k 

i=l 

-l_                       XI                                ^J»J        -•             J. 

(n) 

(n) 

(n) 

(n) 
ki     1 


w  ...w  ^_, _,  ,  ,  a,  „  ,  .  . . ,  a,        are  determined  from  the  condition  that 

kl       k2  kn 

(3.10)  (Tw!"^  -  XSwJ"\,  K(^ .)    =    0      ,  j  =  1,  2,  .  .  .  ,  n      . 

k  k  J 

The  condition  (3.  10)  leads  to  linear  homogeneous  algebraic  system 

n 

(3.11)  /         S  t..  -  Xs..  I   af"^     -    0 

jz^     (    ij  ij  )      kl 

with    X    as  a  parameter,   where    t..  =  {T6.,K6.)    and    s..  =  (Si.,K^.)„      The  system 
(3.  11)  evidently  admits  of  a  nontrivial  solution  for  the    a      's    if  and  only  if 

(3.12)  det     It..  -  Xs..|"  =    0      . 

The  vanishing  of  this  determinant  provides  us  with  an  algebraic  equation  of  the 

n-th  degree  whose  roots    X      ,X      ,  .  .  .  ,X         will  represent  the  approximations  to 

(n) 
the  eigenvalues  of  (3.  1).     The  coefficients    a         in  (3.  9)  are  then  determined  from 

the  homogeneous  system 

n 

(3.  13)  /         I  t. .  -  X,     s. .  ;    a,  .      =0 

^^     (    ij         k      ij  )      kl 

The  following  arguments  offer  the  justification  of  the  GM- method. 

If  we  define  the  operator    N    by  setting    N  =  T     S,  then  the  eigen- 

values of    N,     being  defined  as  those  values  of  ju    for  which 


(15)  -1 

Let  us  note  that  the  domain  of  definition  of    T     S    is  equal  to    D      and, 

-1 
therefore,   if    D    C_  H  ,     then  by    N    we  shall  also  denote  the  extension  of    T     S    to 

o  O 

all  of   H      while  in  case    D„  Z?  H      we  will  use    N    to  denote  the  restriction  of 
o  So 

t"   S    to    H   . 
o 
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Nu    -    /LIU    =    0      ,  (3.14) 

will  be  reciprocals    n  -  l/X    of  the  eigenvalues    X    of  our  original  problem  (3.  1), 
as  is  seen  at  once  by  operating  on  both  sides  of  (3,  1)  with    T      .      However,   the 
eigenelements    w   of    N,     which  by  definition  lie  in    H   ,     will  not  in  general  satisfy 
(3.  1)  in  the  usual  sense  unless    w  c   D    ;      for  that  it  is  sufficient  that    w  c   D   . 

1  o 

Nevertheless,   any  eigenelement  of    N    will  be  considered  as  an  eigenelement  of 

(3.  1)  (if  necessary,   in  generalized  sense).     Considered  in    H  ,     the  condition  (3.  10) 

reduces  to    [w      -XT     Sw      ,<J.]  =  0,     j  =  1,  2,  . .  , ,  n,     while  (3.  11)  can  be  written 

K.  K  J 

in  the  form 

n 


/         >  s. .  -  Ait. .  >  a, 
f— ^     (11       ^  ij  )      ki 


^^     =    0      ,  (3.15) 

i=l       ^     ^J  ^J 


where    s.  .=[N(^.,<^  .]    and    t. .  =  [<^  .,<^  .],     which  is  precisely  the  system  one  obtains 

J  J  J  J 

when  the  method  is  applied  to  equation  (3.  14)  in  the  space    H   . 

o 

Let     I  ip.  >      denote  the  set  of  elements  in    D       obtained  by  orthonormaliz- 

ing  the  sequence       /  <^    (     ^^  the  H   -metric.     It  was  shown  in  [26]  that  the  ortho- 

normalization  process  leads  to  one  and  the  same  result,    i.  e.  ,     w,        will  not  be 

k 

changed.     However,   if  we  take    w,        in  the  form    w,       ~  ^      d,     i//     the  system 

k  k  f— '      ki     1  "^ 

1=1 

(3.  15)  is  then  replaced  by  its  equivalent  and  more  convenient  form 


n 
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where    7^.  =  [Ni//.,i//  ],     i,   j  =  1,  2,  .  .  . ,  n. 


Let    H      be  a  subspace  of    H      determined  by    ^i',,  ^_,  .  .  .  ,0      and    P      the 
n  o  ,        .         i      2  n  n 


orthogonal  projection  of  H  onto  H  .  Since  }  i' ■  \  is  complete  in  H  ,  the 
sequence  >  H  ^  is  projectionally  complete  in  H  .  It  is  easy  to  see  that  the 
equation 

P   NwJ^^     -    MwJ^^     =    0  (3.17) 

n       k  k 
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is  equivalent  to  the  system  (3.  16),    i.  e. ,   the  applicability  of  the  GM- method  to 
(3.14)  IS  equi\ 
easy  to  prove 


(3.  14)  IS  equivalent  to  determining    a,  ,,.,.,  a,        from  equation  (3. 17).     It  is  now 

kl  kn 


Theorem  3.  1,      If    T     S    can  be  extended  to  a  compact  operator    N    in 

H  ,     then 

°  (     (n)  ) 

(a)  The  sequence     <  X       ^      of  approximate  eigenvalues  determined  by 

the  GM-method  (3.11)  converges  to  the  eigenvalue    X    of  (3.1)  or    /u  -  l/X    of    N. 

(b)  The  sequence     ]  w       (      °f  ^^^  corresponding  normalized  approxi- 
mate eigenelements  contains  a  subsequence  converging  to  an  eigenelement    w    of 
N    belonging  to    /u  =  l/X, 

(c)  Assuming  additionally  that    S    is  K-positive  and  the  sequences 

\  X       >      and     i  ^.    i     of  approximate  and  exact  eigenvalues  are  arranged  in  the 
order  of  increasing  magnitude,    each  as  many  times  as  its  multiplicity  indicates, 
then     <  X       i      converges  to    X   ,     as    n  —  oo. 

Proof.      Since    N    is  compact  in    H      and     >  H     (     is  projectionally  com- 
plete  in    H   ,     the  sequence  of  operators      <  N     >    =  j  P  N  >      converges  uniformly 
to    N    in    H   .      In  view  of  this  and  the  discussion  preceding  Theorem  3.  1  and  the 
fact  that    ct(N)  "  0(T     S),     the  assertion  of  Theorem  3.  1(a)  follows  immediately 
from  Lemma  3.  1. 

To  prove  Theorem  3.  1(b)  note  that  the  sequence     ^  w        i  ,     being 
bounded,   is  weakly  compact.     Let  us  use     \  w       i     also  to  denote  is  subsequence 

which  converges  weakly  to  some  element    w    in    H   ,       Evidently  the  sequence 

(     (n)    (n)  )  ,  (n)  .    (n)  °,  ^     ^  ^        . 

<  fj,      w       t  ,     where   jj.        =  l/X      ,     converges  weakly  to     juw     and,    since    N    is 

compact,      \  Nw       *      converges  strongly  to    Nw.      Hence,   in  view  of  this  and 

(3.3),   the  equality 

Nw  -    /J      w         =    Nw    -    N  w         =    Nw    -    Nw         +   Nw         -    N  w 

n  n 

and  the  uniform  convergence  of    N      to    N    in    H      imply  that 
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|Nw-m^"w^"^1     <     INw-Nw^^^^I     +    JN-N    1     ^    0 


as    n  -►  oo;      i.  e. ,    ^      w       -•  Nw    in  the  H   -norm.     Consequently,     Nw  =  ^w,     as 
desired. 

In  order  to  prove  (c)  note  that,    since    S    is  K-positive,   the  operator    N 
IS  selfadjoint  and  positive  in    H   .       So  if  we  characterize  the  eigenvalues  of    N    as 
extreme  values  of  the  quotient    [Nu,  u]/[u,  u],     using  the  recursive  characterization, 
we  have  all  the  information  we  need  to  establish  Theorem  3.  1(c).     In  fact,   we  have 
for  all  normalized  elements    u    orthogonal  to  the  first    (k-1)    eigenelements  of    N 

H.^^     <     max  [P  NP  u,  P  u]    =    max    !  [Nu,  u]  +  [P  NP  u,  u]  -  [Nu,  u]  I 
k—  nnn  (  nn) 

<    H.     +    max    |[P  NP  u,  u]|     <    m,     +    Ip  N  P     -  n|      . 
—       k  'nn'—       k  'nnn  ' 

In  the  same  way  we  obtain  the  inequality 

pt       <     max  [Nu,  u]    =    max    \  [P  NP  u,  u]  +  [Nu,  u]   -    [P  NP  u,  u]  ^ 


(       n       n  ■"  '    ••  n       n 

<    ^J^^    +  In  -  P  NP    I      . 

—       k  '  n       n' 

(n) 


Thus,      \ti,       -  n    \   £   Jn  -  P  NP    j  —  0,     as  desired.     This  completes  the  proof 
of  Theorem  3.  1. 

Remark  7.      Theorem  3.  1  shows  that  the  applicability  of  the  GM- method 
for  the  approximate  calculation  of  the  eigenvalues  and  eigenelements  of  the  prob- 
lem (3,  1)  depends  essentially  on  the  complete  continuity  of  the  operator    T     S    in 
H   .      This  underlines  the  usefulness  of  the  discussion  in  Section  1.4  which  deals 
with  various  conditions  under  which  the  operator    T     S    can  be  verified  to  be  com- 
pact in    H   . 

o 

Furthermore,   Theorem  3.  1  allows  us  to  formulate  the  eigenvalue  problem 
for  non- selfadjoint  differential  and  integrodifferential  equations  of  an  even  and  odd 
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order.     For  further  discussion  of  this  topic  as  well  as  for  the  statement  of 
Theorem  3.1(a)  see  [23,    24]. 

3.3,     Special  cases.      In  this  section  we  show  that,   by  specifying  the 
operators    K    and    S,     the  most  important  direct  methods  used  in  the  approximate 
solution  of  the  eigenvalue  problem  (3.  1)  can  be  deduced  as  special  cases  of  the 
GM-  method.     In  each  of  these  special  cases  Theorem  3.  1  remains  valid  provided, 
of  course,   that  the  operators  satisfy  the  indicated  conditions.     This  will  always  be 
assumed  to  be  the  case. 

(i)    Ordinary  Ritz  method.      If    K  =  I    and    S  =  I,     then    T    is  selfadjoint 

and  positive  definite  on    D    ,     H      is  the  completion  of    D       in  the  metric 

[u,  v]  =  (Tu,  v),     and  the  GM-method  reduces  in  this  case  to  the  ordinary  Ritz 

method 

n 

(3.11.)  >        j  (T(^.,(i>.)  -  X((i..,(^.)  J  a!,""^     =    0 

which  has  been  extensively  studied  and  used  in  various  applications  [4,  10,  25,  27]. 

(ii)    Generalized  Ritz  method.      In  this  case  we  take    S  =  I    so  that    T    is 

K-p.  d,   and  the  GM-method  becomes 

n 

(3.nj  /_,    S  (T^.,K(^.)  -  X(^.,K^.)  j  aj""^     =    0      . 

11  r^^    (         1        J  1        J    )      ki 

In  particular^   when  the  operator    T    is  selfadjoint  and  positive  definite  we  may  take 
K    to  be  some  root  of    T. 

(iii)    Galerkin  method.      If   K  =  I,     then    T    is  selfadjoint  and  positive 
definite  so  that  the  GM-method  reduces  to  the  well-known  [4,    25,    27]    Galerkin 
method  determined  by  the  system 

n 


111  ^~r^    (         1     3  1     1    )     ki 


(3.11,,,)  Z-^    ]{T^^A,)  -  MS,if^A^  i  3iJ    =    0      . 

i=l 
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The  space    H      is  here  the  same  as  m  the  case  (i). 
o 

(iv)    Moments  method.       If    K  =  T,     then    T    is  continuously  invertible, 

H      is  the  set  space    D       with  the  metric    [u,  v]  =  (Tu,  Tv),     and  the  GM-method 

reduces  to  the  ordinary  moments  of  moments 

n 

/    ,    j  (T<^.,T^.)  -  X(S<^  ,T(J.)  I   a!"^     -    0      .  (3.11.) 

fl^     ^         ^        J  1        J    )      ki  IV 

Since  in  this  case    H     -  D    ,     we  see  that    N  =  T     S    and  the  eigenelements  of    N 
satisfy  actually  the  equation  (3. 1)  in  the  ordinary  sense. 

Let  us  point  out  at  the  end  of  this  chapter  that,   in  view  of  the  wide  freedom 
in  the  choice  of    K    and  the  generality  of    S,     The  GM-method  can  be  applied  to  a 
much  larger  class  of  problems  than  any  of  the  methods  mentioned  above  which  it 
unifies.     Furthermore,   when  applied  to  the  differential  eigenvalue  problems  the 
GM-method  will  give  a  better  character  of  convergence  than  the  methods  of  Garkin 
or  Ritz. 
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4.     A  General  Iterative  Method 

The  purpose  of  this  chapter  is  to  present  and  investigate  a  fairly  general 
iterative  method  for  the  approximate  solution  of  the  eigenvalue  problem 

(4.1)  Tu    -    XSu    =    0      , 

where    T    is  K-p.  d.  and    S    is   K-p,   relative  to    D       with    D    HD  D    ;      i.e., 

1  o  — —       X. 

(4.2)  (Su,Ku)     >     0      ,  u  e  D^      . 

It  will  also  be  shown  that  the  method,   which  is  developed  here  for  the  problem 
(4. 1)  with  imbounded  operators    T    and    S,     has  also  the  property  of  unifying  and 
extending  the  applicability  of  various  existing  iterative  schemes  to  the  class  of 
problem  (4,  1)  and  which  at  the  same  time  generalizes  some  of  the  results  obtained 
by  a  number  of  authors  [1,  6,  11,  13,  15,  17,  20,  22,  29]  mostly  for  positive  definite  sym- 
metric matrices  and  bounded  operators.     It  is  hoped  that  this  investigation  will  at 
least  p)artly  fill  the  gap  indicated  by  Kantorovich  [16]  and  at  the  same  time  form 
the  basis  for  the  possible  discovery  of  new  and  more  effective  schemes  when  speci- 
fied to  a  particular  class  of  problems. 

4.  1.     The  formulation  of  the  method.      In  this  section  we  formulate  the 
method  and  derive  some  of  its  properties.     Let  us  assume  at  the  outset  that 

there  is  given  a  well-known  K-p,  d.   operator    C    and  constants    M     >   M     >  0 
such  that    D      =  D    , 


Since  our  concern  in  this  chapter  is  to  develop  a  practical  procedure 
for  the  approximate  calculation  of  eigenvalues  and  eigenvectors  of  (4.  1)  we  assume, 
for  the  sake  of  simplicity,   that  the  operators    T,   S,   and  C    satisfy  rather  strong 
conditions.     However,   later  we  will  indicate  how  these  conditions  may  be  some- 
what relaxed.     As  a  matter  of  fact,   most  of  the  results  in  the  first  few  sections 
are  also  valid  when    T    and    C    are  only  K-p.      Furthermore,   it  will  always  be 
assumed  in  this  chapter  that  the  eigenvalue  problem  (4.  1)  possesses  eigenvalues 
with  corresponding  eigenvectors  which  have  the  properties  used  below.     If  neces- 
sary,  this  can  always  be  attained  by  making  recourse  to  Chapter  2, 
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M  (Cu,Ku)     <     (Tu,Ku)     <     M.(Cu,Ku)      ,  u  e   D^    ,  (4.3) 

&  IT 

and  for  every    r    in    H    of  the  form    r  =  Tu  -  XSu,     u  c  D    ,     the  equation 

Ch    -    r  (4.4) 

is  uniquely  and  relatively  easily  solvable  for    h.      It  was  observed  before  that  if 
X    is  an  eigenvalue  of  (^ 
follows  from  (4.  1)  that 


X    IS  an  eigenvalue  of  (4. 1)  and    u  €  D       a  corresponding  eigenvector,   then  it 


.,   ,  (Tu,  Ku) 

This  implies  that  the  eigenvalue  problem  (4.  1)  is  equivalent  to  the  problem  of 
finding  solutions  of  the  operator  equation 

Tu    -    X(u)Su    =    0     ,  u  e  D         .  (4.6) 

We  present  a  method  for  finding  the  solutions  of  (4.  1)  which  is  based  on  this  ob- 
servation and  which  avoids  the  transformation  of  the  problem. 

To  solve  (4.  6)  we  use  the  following  iteration  scheme.     If    u     j^  0    is  an 
arbitrary  initial  approximation  belonging  to    D       and    u.    is  the  iterant  obtained 
at  the  i-th  step  of  our  process,   then  the  succeeding  iterant    u.         is  taken  in  the 
form. 

'i+1  "i  "i^     'i 

or 

u^^      "^    "i    ~    *'^i     '  i  =  0,l,2,  ...      ,  (4.8) 

where    r.    denotes  the  residual  at    u;      i.e., 

r.    =    Tu.    -    X^Su.      ,  X^  =  X(u.)    ,        i  =  0,  1,  2,  . .  .     ,  (4.9) 


u.    ^     =    u.    -    t.c'    r.      ,  i  =0,1,2,...  (4.7) 


the  element    h.    is  the  solution  of  the  equation 
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(4.10)  Ch.    =    r.      ,  i  =  0,1,2, 

11 


and    t.,     i  =  0,  1,  2,  .  .  . ,     are  real  numbers  to  be  determined  by  some  process. 

-1 
Observe  that  it  is  not  necessary  to  construct  the  inverse  operator    C        since, 

as  is  the  case  in  our  method,   it  is  sufficient  to  know  the  solutions  of    Ch  =  r    for 

special    r    in    H.      Let  us  note  that  whenever    u.    is  a  nonzero  iterant  in    D^    then 

r-  1  T 

u  is  also  a  nonzero  iterant  belonging  to    D^    so  that  (4.8)  determines  a  well- 

1+1  -  .  &         &  rp 

defined  sequence     5  u.  >     in    D    .      In  fact,   if    u.  ^  0    and    u.   e  D    ,     then  by  (4.  6), 
(4.9),  and  (4.10) 

(4.11)  (Ch.,Ku.)     =    0      ,  i  =  0,1,2,...      , 

and,    consequently,  by  (4.  8) 

(4.12)  (Cu.    ,,Ku.    J    =    (Cu.,Ku.)    +    t^(Ch.,Kh.)    ,        i  =  0,  1,  2,  .  .  .     . 

1+1         1+1  11  111 

Since    C    is  K-p»  d.   and    D      =D    ,     the  relations  (4.  12),   (4.10),   and  (4. 8)  show 

that    u.    ,   ^  0    and  that    u.    ,    €  D^    whenever    u    ^  0    and    h.  ^  0.      The  last  condi- 
1+1  1+1  T  11 

tion  may  always  be  assumed  to  be  satisfied  for,   if    h.  =  0    for  any    i,     then    u.    is 
an  eigenvector  of  (4.  1)  and  so  a  solution  is  obtained. 

Remark  8.      Before  we  proceed  let  us  note  that  the  method  (4.  7)-(4.  10) 
is  analogous  to  the  scheme  presented  by  the  author  [26]  for  the  solution  of  non- 
homogeneous  operator  equations  in    H.      Furthermore,   it  is  not  hard  to  see  that, 
in  view  of  (4.  3),   the  element    h.    determined  by  (4.  10)  is  equal  in  direction  to  the 
gradient  of  the  functional     X(u)    at    u  =  u.    relative  to  the  metric  in  the  space    H  , 
where    H      is  the  completion  of    D       in  the  metric 

(4.13)  [u,v]^    =    (Cu,Kv)      ,  lu]^   -  [u.u]^      ,  u,   v  e  D^      . 

Hence  we  see  that  under  present  conditions  the  method  is  analogous  to  some  sort 
of  a  gradient  method  and  as  such  it  is  an  extension  of  the  procedure  developed  in 
[11]  for  the  solution  of  (4. 1)  with  finite  selfadjoint  and  positive  definite  matrices. 
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For  the  sake  of  completeness  let  us  compute  the  gradient  of    X(u)    in    H   . 

1  ^ 

Since    grad  X(u)  =  — — — —  [grad  (Tu,  Ku)  -  X(u)  grad  (Su,  Ku)],     it  is  sufficient  to 

find  the    grad  (Tu,  Ku)    and    grad  (Su,  Ku)    in    H   .      To  compute    grad  (Tu,  Ku)    take 

fixed  vector    u    and    h    in    D       and  consider  the  function    f(t)  =  (  T(u+th),  K(u+th) ) 

for  real    t.      A  simple  calculation  shows  that     — -  -  2  Re  (Tu,  Kh)    and 

dt 

1  1*"° 

|Re(Tu,Kh)]    <    )(Tu,Kh)|   =)[C     Tu,  h]    ]    <    |c     Tu|     |h]    .      Hence,   for  a  fixed 

jhj       the  equality  is  attained  for  such    h    for  which    h    is  proportional  to    C     Tu. 

-1  -1 

Thus,   in  H   -metric,     grad(Tu,  Ku)  =  2C     Tu.      Similarly,     grad  (Su,  Ku)  =  2C     Su 

(in    H   ),       Combining  this  with  our  formula  for    grad  X(u)    (in    H   )    we  get  the 

2  -1  (  ) 

formula:      grad  X(u)  =  7- — -—  C       \  Tu  -  X(u)Su  (      (in    H  );      i.  e. , 

(bu,  Ku)  (  )  2 

grad  X(u)  -  -rr- — ——r  h,     where    h    is  determined  by    Ch  -  Tu  -  X(u)Su. 
(Su,  Ku) 

We  first  show  that  under  very  general  conditions  on     \^\     the  sequence 

<  X    ^      converges.     To  find  these  conditions  we  consider  the  change 

AX    =    X(u)    -    X(u  -  th)      ,  h^O      ,  (4.14) 

where  for  notational  reasons  we  omitted    "i"    on    u,   h,   and  t.       In  view  of  (4.  8),   a 
simple  calculation  shows  that 


AX    - 


-2tX(u)  Re(Sh,  Ku)  +  t^X(u)(Sh,  Kh)  +  2t  Re(Tu,  Kh)  -  t^(Th,Kh) 
(Su,  Ku)  -  2t  Re(Sh,Ku)  +  t^(Sh,  Kh) 

t\(u)(Sh,  Kh)  +  2t(Ch,  Kh)  -  t^(Th,Kh)  ^^   ^g. 

(Su,Ku)  -  2t  Re(Sh,  Ku)  +  t  (Sh,Kh) 


\  aX(u 


2  2 

)t     +  2t  -  et 


1  -  2bt  +  ct 


=    dp(u,  t)      , 


where  we  define 


,.    iCh^      ,  p(u,t)    .    ^Lz_-tL  (4.16) 

<S"'^">  l-2bt  +  ct2 
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with 

^      (Sh,  Kh)  ^     Re  (Sh,  Ku) 

^  (Ch,Kh)       '  ^   "       (Su,Ku) 

^  '  (Su,Ku)        '  ^   -     (Ch,Kh)       ' 

s    =    X(h)  -  X(u)      , 

Formula  (4.15)  shows  that  for  a  given    u    in    D       the  change    AX    depends  on 
the  behavior  of    p(u,  t),     as  a  function  of    t,     when    t    varies  through  the  entire 
real  axis  from    -  ao    to    +oo,     so  that  the  mode  and  the  rate  of  convergence  of 
the  sequence     )  ^    (     will  depend  on    p(u^t). 

4.  2.     The  graph  of    p(u,  t).      To  graph    p    note  that    p  =  0    when    t  =  0 

and    t  =  2/as.      Moreover,   since    1  -  2bt  +  ct     =  ju-thj    /juj      >  0    for  all    t, 

it  is  clear  that    p    is  a  continuous  function  of    t    for  all  real    t    including    t  -  +  ao 

for,   in  the  latter  case,        lim    p(u,  t)  =  lim  p(u,  t)  ~  -as/c.      Let  us  add  that 

t—  -00  t  —  00 

p  =  -as/c    also  for    t  =  as/2(abs-c).      Furthermore,   a  simple  calculation 

shows  that 

2 
,  .  ^P     -    9   (abs  -  c)t     -  ast  +  1 

^    '    ^  dt      ~    ^  2"2 • 

(1  -  2bt  +  ct  ) 

Hence    dp/dt  =  0    for  those  values  of   t    for  which 

2 
(4.19)  (abs  -  c)t      -    ast    +    1      =     0      . 

Since    p  =  -as/c    for    t  =  +  oo,     the  equation    dp/dt  =  0    has  at  least  one  non- 
zero real  root.     In  fact,   it  is  not  hard  to  see  that  it  has  two  distinct  roots 


63 


9     and    9    given  by 


W<  '^ 


~    _      sa  ->-   V  (sa)     -  4(abs  -  c) 
2(abs  -  c) 


(4.20) 


-V.  ■' 


sa  -   V  (sa)     -  4(abs  -  c) 
2(abs  -  c) 

as9* -  1 
Indeed,   if    9'    is  a  solution  of  (4.  19),   then    abs  -  c  = —    and 

9'^[(sa)^  -  4(abs- c)]     =     (as9')^    -    49'(abs  -  c) 

^     (as9')^    -    4(as9'-l)  (4.21) 


=     (as9'  -  2)^ 


with 


/      ntv  (2  -  as9')9'  9*  ,^    „  . 

P<"'^^    =     (2  -  as9')(l  -  b9')     =    r^TW  <4-22) 


for  the  fact  that  the  denominator  of    p    is  positive  for  all    t    implies  that 
2  -  as9'  ?t  0,      This  and  (4.  21)  imply  that  the  discriminant    (sa)     -  4(abs  -  c)  >  0. 
Hence  the  equation  (4.  19)  has  two  distinct  real  roots  given  by  (4.  20)  or  equiva- 
lently  by 

•^  2  2 

9     =    =IIZZIZZZZZ=:     '  ^    "    .      (4.23) 

sa  -   V(sa)    -  4(abs  -  c)  sa  +  V  (sa)     -  4(abs  -  c) 

The  above  discussion  shows  that  the  graph  of    p(u,  t)    depends  on  the  algebraic 
sign  of    (abs  -  c)    and    s.      Hence  the  following  five  cases  are  sufficient  to  de- 
scribe it. 


-  64  - 


2t 
Case  1.      If  s  ■=  0,    then  p  -  -.        lim    p(u,  t)  -    lim  p(u,  t)  =  0, 

l-2bt  +  c         t--oo  t  —  00 

9   =  _  (1)^/2^     Q  ^  (-)^^^,     min  p  =  p(u,e)  =  =^-i-   <  0    and    max  p  =  p(u,  e)  = 

^  ^  t  Vc  +b  t 

1 
s  >  0.      The  graph  of    p    in  this  case  is  given  by  Fig.   1, 

Vc  +b 

Case  2i.      If    s  >  0    and    abs  -  c  >  0,     then  it  is  not  hard  to  see  that  m 

this  case    0  <  0  <  7'  <  t"  <  9,     where   T'=  — ,     7"=  2(abs^-  c) '     ^   ^""^    ^ 

are  given  by  (4.  23);    and    min  p  =  p(u,  9)  =  ^  <  0    and    max  p  =  p(u,  9)  = 

t  1  -  b?  t 

Q 

= >  0.       The  graph  of    p    is  given  by  Fig.    2i. 

1  —  b9 

Case  2j.      If    s  >   0    and    abs  -  c  <   0,     then   ?  <  7"  <  0  <  9  <  7', 

-^9  9 

min  p  =  p(u,  9)  = <  0,     and    max  p  =  p(u,  9)  =       ^  ^^    >  0.      The  graph  of 

t  1  -  b9  t 

p(u,  t)    is  given  by  Fig.   2j. 

Case  3i.      If    s  <  0    and    abs  -  c  >  0,     then    9  <  7"  <  t'  <  9^  <  0> 
'^ 
-^9  9 

min  p  =  p(u,  0)  =  <  0,     and    max  p  -  p(u,  9)  - -r   >  0    with  Fig.   3i 

~  1  —  by 

t  1  -  b9  t 

representing  the  graph  of    p. 

Case  3j.       If    s  <  0    and    abs  -  c  <  0,     then   7*  <  9'  <   0  <  7"  <  9    with 

~             9                                                                           9 
min  p  =  p(u,  9)  =  <  0,     max  p  =  max  p(u,  9)  -  — — -   >  0,     and  the  graph 

t  1  -  b9  t  ^  -  ^^ 

given  by  Fig.    3j. 

Remark  9.      We  see  from  the  graphs  that,   for  a  given    u    in    D    ,     the 
fiinction    p(u,  t)    attains  its  maximum  value  which  is  positive  at  the  second  root 
9    of  equation  (4.  19).     This  remark  will  be  useful  when  we  discuss  the  choice  of 
the  optimum  parameter. 
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Fig.    1 


Fig.    2i 


Fig.   2j 


Fig.   31 


Fig.   3j 
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4.3.     The  convergence  of  the  method.      Since    d  >  0,     it  is  seen  from 
(4.  15)  and  the  graphs  of    p(u,  t)    that,   for  a  given    u,     AX  >  0    for  all    t    which 
are  such    t  e   (0,  qo)    when    s  =  0    (Case  1),     t  e  (0,7')    when    s  >  0    (Case  2), 
and    t  c   C[7',0]    when    s  <   0    (Case  3),     where    C[7',  0]    denotes  the  comple- 
ment of    [7',  0]    in  the  set    R    of  real  numbers. 

The  above  remark  indicates  that  at  each  step  of  the  process  the  range 
of    t    for  which    AX  >  0    depends  upon  the  iterant    u.      Let  us  first  show  that 
for  a  given  eigenvalue  problem  with  fixed  operators    T,   S,   K,   and  C    we  can 
determine  an  interval  of  variation  of   t    for  which    AX  >  0    independently  of    u. 

Lemma  4.  1.      If    T,   S,   K,   and  C    satisfy  conditions  of  Section  4.1, 
then    AX  >  0,     independently  of    u    in    D    ,     for  any    t    in  the  interval 


(4.24)  0     <     t     < 


"i 


where    M      is  the  constant  determined  by  (4.  3).     If  in  addition  we  assume  that 
D     =  D       and  that  there  exist  two  constants    m     >  0    and   S.     >  0    such  that 

(4.25)  (Tu,Ku)  >     m^(Su,Ku)      ,  u  e   D^     , 
and 

(4.26)  (Su,Ku)  >    Je^(Cu,Ku)      ,  u  e   D^     , 
then    AX  >  0,     independently  of    u,  for  any    t    in  the  interval 


(4.27)  0    <     t    < 


"1  -  "l^ 


Proof.      We  have  seen  above  that,   when    s  <   0,     AX  >  0    for  any 
t  >  0    and  that,   when    s  >  0,     AX  >  0    for    t    in    0  <  t  <  7'.      Thus,   to  prove 
the  first  part  of  Lemma  4.  1,   it  is  sufficient  to  estimate  the  quantity   7      in  the 
latter  case.     Since,   in  virtue  of  (4.  16)  and  (4.  17), 

(^   9R\       a    <     —  ^    (Th,  Kh)     _     (Sh,  Kh)   (Tu,Ku)  (Th,Kh) 

14.  ^«;       u  7'     "    ^^  (Ch,Kh)  (Ch,Kh)    (Su.Ku)  (Ch,Kh) 
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we  obtain  from  (4.  28)  and  (4,  3)  the  inequality    0  <   2/7"    <  M      or    7'  >  2/M 
valid  for  all    u    in    D    .       This  is  precisely  (4.  24). 

To  prove  the  second  part  of  Lemma  4.  1  note  that  when  our  operators 
satisfy  the  additional  conditions  (4.  25)  and  (4.  26)  then  from  (4.  28)  we  obtain  the 
inequality    0  <   2/7'   <   M     -J^,m,     or    7'  >   2/(M    -S.    m  ).     i.e.,   (4.27). 

Remark  10.       Let  us  remark  that  the  additional  conditions  (4.  25)  and  (4.  26) 
will  be  satisfied,   for  example,   when  we  deal  with  finite  matrices  or  bounded  opera- 
tors in    H.       In  this  case  Lemma  4.  1  generalizes  the  corresponding  results  for  the 
symmetric  and  positive  definite  matrices  [  6  ,   13]. 

Lemma  4.  2.      If  at  each  step  of  the  process  (4.  8)-4.  10)  the  sequence 
<  t    >  ,     t    i^  0,     is  so  chosen  that 

p     =    p(u.,t.)     >     6    >    0      ,  h.  ^  0,     i  =  0,  1,  2,  .  .  .     ,        (4.29) 


then 


(a)  The  sequence     }  ^    (      converges  to  some  real  number    X*. 

00 

(b)  The  series    ^_^   d.    converges. 


1=0 

u. 


(c)    The  sequence     <  u    /    =    < —  >     is  bounded  in  the  H   -norm  and 

'    i'  1 


|c"'^(Tu^  -  xW)!       -    0      ,        as    i  -  00     .  (4.30) 

Proof.      Let    t.  ^  0    be  so  chosen  that  (4.29)  is  satisfied.     Since    d.   >  0, 
1  1 


the  definition  (4.  14)  and  formula  (4. 15)  show  that 


X^     -   X^"*"^    =    d.p.     >     d.6    >     0      .  (4.31) 

11    —       1 

Hence     )  ^    <      is  a  monotonically  decreasing  sequence  of  positive  numbers  and. 


(17) 

Since    d    >  0,     it  follows  from  (4.  15)  that  for  the  validity  of  Lemma 

4,  2(a)  it  is  sufficient  to  assume  that    p.   >  0    for    i  =  0,  1,  2,  .  .  .    . 
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therefore,   converges  to  some  number    X*. 

To  prove  Lemma  4.  2(b)  note  that  by  the  recurring  relation  (4.  15) 


j=0      ^  J 
and  consequently  the  convergence  of     j  ^    (     implies  the  convergence  of  the 


(4.32)     X^^^    =  X^   -    d  p     =  X^""^    -   d.      D.    ,    -    dp.    = 

11  1-1   1-1  11  .    „ 

J=0 


series    y-^    d.p..       In  virtue  of  condition  (4,  29),   this  implies  the  validity  of 

j=o     ■'  J 

Lemma  4.  2(b). 

Consider  the  sequence      >  u    ^  .      Clearly  it  is  bounded  in  the  h' -norm 

since    |u  j      =  1,     i  =  0,  1,  2,  .  .  .    .       Moreover,   by  Lemma  4.  2(a)  and  (4.  3), 

i  u    >      is  bounded  in  the  H   -metric  for 

•   2  •         •         (Cu.^KTu.)        (Cu.,Ku.)         .         ^o 

lu^l^  =  (Cu\ku^)  =   -(s;^^   =   (Tu',  Ku')  '^'^W-       W^  ^^"  *^^^  '^  *^'"  "^"" 

11  11  2 

the  upper  bound  of     >  u    (      depends  on  the  initial  element    u   .      To  complete  the 

(        )  o 

proof  note  that  (4.  30)  follows  from  (b).     In  fact,   from  (b)  we  derive 

(Ch.,Kh.)  ,         .  .      .     ., 

''i    =     (Su.Ku.)     =    |C     (Tu    -XSu)!^    -    0     .       as    i-«     . 


may  converge  to  zero,   as    i  —  oo.      However,   since  by  (4.32)  the  limit    X*  =  0    if 


Let  us  remark  that  when    u^    is  chosen  arbitrarily  the  sequence     \  X 


00 


and  only  if 


(4.33)  X°    =     /^       dp. 

i^     '  ' 

it  is  rather  unlikely  that  (4.  33)  may  be  satisfied  in  practice. 


In  order  to  replace     /  "    t     by     }  ^    <      ^^  the  above  lemma  we  need,   in 


general,   some  additional  hypothesis  on 


h] 
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Lemma  4.  3.      If  in  addition  to  (4.  29)  we  assume  that  for  some    6     >  0 

t.     <     6^      ,  i=0,  1,  2,  ...      ,  (4.34) 

then 

o 

(a)    There  exists  a  constant    c  >  0    such  that    lim  (Cu  ,  Ku.)  =  lim  I  u.  I     =  c: 

1        1  '    i'  2 

<b)    0<luj^<|uj|^<...    <    |u.|2<...    <'e; 

(c)     |c"'^{Tu.  -  X^Su.)J„  -  0,     as    i  -  oo. 
'  1  1  '  2 

Proof.      Using  (4. 12)  recursively  we  obtain  the  relation 

.  (4.35^) 

j=o         -"  -J 


rms  a  mono- 


where    g.  =  jh.  (     /  Ju.  j    .      It  follows  from  it  that     IJ^joC      fo 
JJ'^J'^  vi2) 

tonically  increasing  sequence  which  is  bounded  below  by    |  u    J    .      It  is  known 

i  2 

that  the  product      Jj  (1  +  t.  g.)    converges,   as    i  —  qq,     if  and  only  if  the  series 

00  „  1=0''''  00  00 

^_^t.g.    converges.     In  view  of  (4.  34),   the  series    ^  ^  t.g.    converges  if    y^   g. 
j=0     ■'    •'  j=0    "^    ■'  j=0     •' 

converges.     Since,   by  (4.  3),   Lemma  4.  2(a),   and  the  assumption    X*  >  0, 

d.      (Tu,Ku.)  M 

^j    =    ^'(Cu^,Ku^)     -     ^'3      '     J=°'^'2,...       ,  (4.35) 


00 


the  convergence  of    ^^  g.    follows  from  (4.35)  and  Lemma  4.2(b).      Thus,   there 

i=0      ^  00 

exists  a  constant    c  >  0    such  that      TT  (1  +t.ff.)  =  lim     c  =  c.       Evidently, 

j=0  ^^  .      '    i+l'2  ^' 

c    is  the  least  upper  bound  of  the  monotonically  increasing  sequence     5  ju.      j     (. 
This  implies  Lemma  4.3(b).     The  assertion  (c)  follows  immediately  from  (b)  and 
the  fact  that    lim  g.  =  0.      Indeed,   by  Lemma  4.3(b), 

lim  Ic"    (Tu.  -  X  Su.)l      =  lim  Ih.l      =  lim    )  g.lu.l    (<   c  lim  g.  =  0.      This  completes 
'  1  1  '2         .      '    i'2         .       (     1'    i'2;—  1 

1  111 
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the  proof  of  Lemma  4.  3. 

Suppose  there  exists  a  constant    J^     >  0    such  that 

(4.36)  (Cu,Ku)     >    J^^^Su,  Ku)      ,  u  e   D^      . 

The  inequalities  (4,  3)  and  (4.  36)  imply  that  the  operators    C     T    and    C     S, 

considered  on    D    ,     are  bounded  in  the  H  -norm,   where    C    is  londerstood  in 

^  -1 

the  extended  sense.     Let    W      and    N      denote  the  unique  extensions  of    C     T 

o  o 

C     S    to  all  of    H   ,     respectively. 

Theorem  4.  1.      Let    T,   S,   K,   and  C    satisfy  conditions  (4.  3)  and  (4.  36) 

and  let     <  t.  !     be  such  that  (4.  29)  and  (4.  34)  are  fulfilled.     Then  for  any    u     ^  0 

^    ''                    (     i)  (        )  ° 
in    D       the  sequence     7^    <>     where  '»  u    >     is  determined  by  (4. 8)-(4.  10),    con- 
verges to  a  number    X*  >   M„£„  >  0  and    I  C°   (Tu.  -  X  Su.)l     -  0,     as    i  -*  ao. 
^                          ..—       22                        '                1               1  '  2 

If  the  sequence     )  u.  *      does  not  converge  weakly  to  zero  (in    H   ),     then  there  is 
u*  J^  0    in    H      so  that 

(4.37)  W  u*    -    X*N  u*    =    0      . 

o  o 

If,   in  addition,     D     ZD   H  ,     then    X*    is  an  eigenvalue  of  (4.  1)  and    u*    its  eigen- 
vector. 

Proof.      The  first  part  of  Theorem  4.  1  follows  from  Lemma  4.  3  and 
(4.  36)  since,   by  (4.  36)  and  (4.  3),     X^  =  — ^- ^—-   >  Mj^  >  0.      To  prove  the 

second  part  note  that  since  by  Lemma  4.  3(b)  the  sequence     )  u.  ?      is  bounded  (in 

H  )    we  can  extract  from  it  a  weakly  convergent  subsequence     ]  u    i     (in    H  ). 

*  (         ) 

By  hypothesis,   the  weak  limit    u      (in    H  )    of     1  u    >      is  not  zero.     Hence,   in 

-1  -1  ^         ' 

view  of  the  boundedness  of    C      T    and    C     S    (in    H  ),     the  sequence 

I  z     j    =    j  C~   Tu     -  X  C~    Su     I      converges  weakly  to    W  u*  -  X*N  u*    (in    H  ). 

But,   by  the  first  part  of  Theorem  4.  1,     >  2;     *■      converges  also  strongly  to  zero 

(m    H  ).      Hence  the  equation  (4.37)  is  satisfied. 


-  71  - 


To  show  the  last  assertion  of  our  theorem  observe  that,   in  virtue  of  (4.  3), 

T  satisfies  all  the  conditions  of  Theorem  1.  3.     Hence  it  has  a  K-p.  d.   s.  g.  F.  e.     T 

given  by    T     =  CW  .       Furthermore,   the  condition    D„  ZD  H„    implies  that 
^  -^       o  o  S  2         _ 

N     =  C"   S    and  that  (4.  37)  can  be  written  in  the  form    W  u*  =  X*c'   Su*.      Applying 
o  o 

C    to  both  sides  of  this  equation  and  using  the  fact  that    CW     =  T      we  obtain  the 

^  o         o 

equality 

T   u*    -    X*Su*    =    0      , 
o 

i.  e. ,     X      is  an  eigenvalue  of  (4.  1)  and    u      is  its  corresponding  eigenvector. 

Remark  11.      Let  us  remark  that  so  far  no  use  has  been  made  of  the  free- 
dom in  choosing  the  operator    K.      Thus,   if    K    is  chosen  to  be  some  closed  operator 

with    D      =  D    ,     then  by  Theorem  1,  2  and  the  inequality  (4.  3)  the  operators    T    and 
K  T 

C    are  closed  and  continuously  invertible  and  for  which  the  inequality  (1.12)  is  valid. 

Hence,   in  this  case,     D[T]  =  H     =  H„    so  that  the  condition    D„  ID    H„    is  always 

o         2  b  Z 


satisfied.     Of  course,   other  choices  of    K    are  possible  so  that    D     ZD   D      —>  H   . 

b    K.  d 

However,   we  will  not  dwell  on  it  now. 

Let  us  remark  that  if    C     S    is  also  compact  in    H  ,     then 

-1-1  -1 

JC"   Su     -  C"   Su*j     -*■  0    and    j  t"   Cz    j     —  0,     as    k  ■*  ao.      Consequently,   the 

-1  -1 

boundedness  of    T     C    and    T     S    (in    H„)    and  Lemma  4. 3  imply  that 


+  x*|t-'c|  Jc-^su    -C"Wl     «  0 


(4.38) 


as    k  —  oo;      i.  e. ,   in  this  case     <  X    f      converges  to  an  eigenvalue    X      of  (4.  1)  and 
W    >      converges  strongly  (in    H„)    to  the  eigenvector    u*    corresponding  to    X*. 

At  the  end  of  this  section  let  us  point  out  that  in  general  we  cannot  assert 

that    X      is  equal  to  the  smallest  eigenvalue    X,     (=  inf      ,„   ' -]    of  (4.  1).      How- 

1  (Su,  Ku) 

ueDrj, 

ever,   if  we  assume  that    X      is  an  isolated  eigenvalue  of  (4.  1)  of  finite  multiplicity 

then  for  some  special  choices  of  the  operator    C    with  some  additional  condition  on 
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<  t.  »      we  shall  show  that    X     =  X      and  that     ]  u.  >      converges  to  some  vector  in 
the  eigenspace  belonging  to    X  . 

4.  4.     Two  simple  choices  of    C.      In  this  section  we  show  that  if    C    is 
taken  to  be  either    T    or    S,     then  the  assertions  of  the  last  paragraph  indeed  take 
place. 

Let  us  denote  by    H      the  space  of  eigenvectors  belonging  to    X      and  by 

"'"H      the  orthogonal  complement  of    H      in    H   .      Assume  that    u     4      H      and 
o  ^  o  o  o  o 

that     5  t.  J      is  bounded  from  below  by  some  constant    6     >   0;      i,  e. , 

(4.39)  u      =?w  +  v      ,         w  €  H^  ,       Iw]   =  1  ,      €     >0,       v     c  D^  fl -'"H^ 

ooo  o'^  o  oTo 

and 

(4.40)  ^i    >     ^1  >    0     '  i  =  0,1,2,  ...      . 

Let  us  define  the  closeness  of  two  elements    u    and    z    in    H      by 

(4.41)  sin2[u;z]       =    1    -    [TT-'-Trt      • 

2  u|        |zj       2 

2  2 

2 

It  is  obvious  that    sin    [u;z]     =  1    when    u    is  orthogonal  to    z    (in    H  )    and  that 

•    2,       , 
sin    Lu;zJ     -  0    when    u    and    z    have  the  same  direction.     Furthermore,   it  is  not 

hard  to  show  that  when    u  =  ?w  +  v,     where    jwj      =  1    and    [w,  v]     -  0,     then 

2  '""U 

(4.42)  sin    [u;w]      =     ;     . 

"      Ml 

Accordingly  we  shall  say  that  a  sequence  of  elements     )  u.  J      converges  "in  direc- 
tion" to  an  element    w    in  H„-metric  if 

2 

r         i2 
2  ^^''^l? 

(4.43)  lim    sin    [u^.vf]^    =     lim    (1 ^ =— )    =    0      . 

i^oo  '  i^oo  luj^  |w|2 
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Lemma  4. 4.      If    C    is  either    T    or    S    and  the  initial  approximation    u 

(         ) 

is  of  the  form  (4.  39),   then  the  sequence     <  u     >  ,     n  -  0,  1,  2,  ... ,     determined  by 

(4.  8)--(4.  10)  is  of  the  form 

u      =    C   w    +    V       ,  n  =  0, 1, 2,  . . .      ,  (4.44) 

n  n  n 

where    w  e  h\     Iwl   =  1,     €     >  0,     and    v     c  D^/^-^-H  . 
o      ^     '  n  n  T  o 

Proof.       In  view  of  mathematical  induction  and  our  assumption  (4.  39)  it  is 

sufficient  to  prove  (4.  44)  for    n  =  i  +  1,     assuming  its  validity  for    n  =  i,     i.  e. , 

assume  that    u.  =  ?  .w  +  v.,     where    |w-l,     weH,     €.>0,     and    v.   e   D_'    1  """H   . 
Ill  '     '  oi  iTo 

Using  this  assumption  and  the  formulas  (4.  8)-(4.  10)  we  obtain  in  a  straightforward 
manner  that  when    C  =  A,     then 

u.    ,     =    u.    -    t.h.    =    |.[l+t.(r l)]w    +    (1  -  t  )v.    +   t.xV"   Sv. 

1+1  1  11  11  X  111  1 

^  (4.45) 

=    ^i+1^    "-    Vl      ' 

i  .    _- 

where    ?     ,   *l.[l+t.(T D]  >  0    and    v.    ,   =  (1  -  t.)v    +  t.X^T~   Sv.,     and  when 

1+1         1  1  X  1+1  111  1 

C  =  S,     then  \ 

u.         =    u.    -    t.h.    =    ?.[1  +t.(X^-X  )]w    +    (l+t.X^)v.    -    t.s'^^Tv. 
1+1  1  1  1  1^  1  1^  111  1  ^4^4g^ 

=  ^i+1^  "-  Vi    ' 

where   ?        =  ?  [1  +t.(X^-XJ]  >  0    and    v.    ,   =  (1+t.X^v    -  t.s'   Tv..      It  follows 
1+1        r  1  1  1+1  111  1 

from  (4.  45)  and  (4.  46)  that  Lemma  4.  4  will  be  proved  if  we  show  that 

~  1  1 

fv       ,  ul  =  r  V       ,  ul  =  0    for  any    u  e  H   .      But  this  is  obvious  since    v.    e  H      and 
■■   i+1  1+1  o  I  o 

hence    [v.    .,,  u]  =  (1  -  t.)[v.,  u]  +  t.X^Sv.,  Ku)    =    0    and    [v.    .,u]    =>-[v       ,u]     = 
1+1  111  1  1+1  1      1+1       1 

=  X  (l+t.X^)[v.,u],    -t.(Tv.,Ku)  =0. 
1  111         11 

Theorem  4.  2.       If    C  =  T,     X      is  an  isolated  eigenvalue  of  (4,  1)  of  finite 

multiplicity,      |  t.  ^      satisfies  conditions  (4.  29),   (4.34),   and  (4.  40),     u      is  of  the 

form  (4.  39),   and      /  u.  |     is  determined  by  (4.  8)-(4. 10),   then 
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(a)  ]  ^    \     converges  to    X     =  X   • 

(b)  ?  "•  <      converges  in  direction  to    w    in  the  H  -metric  with  the 
error  estimate 

(4.47)  u  -i.w    <  (-4 — r^)      |u.L    ,       c.?fo    . 

'    1  1     '     —  A     -  A  ^    l'  1  1 

Z  1 

Proof.      To  prove  Theorem  4.  2(a)  we  use  Lemma  4.  4  and  (4.  45)  to  get 

x'  X^ 

(Tu.    .,Kw)    =    [1  +t.(^  -l)](T?.w,Kw)    =    [1  +t.(T-  -l)](Tu  ,Kw)      . 

Using  the  above  relation  recursively  we  obtain  the  formula 

i 

(4.48)  (Tw.    ,,Kw)    =      TT    (l  +  r)(Tu,Kw)      . 

j=0  ^         ° 

where    C.  =  t.(xVx,   -  1)  >  0    since    X^  >  X,     for  all    i    and    (Tu  ,  Kw)  >  0. 
J        1  1  -     .  -.1  o 

Formula  (4.48)  shows  that  <  (Tu.  ,  Kw)  >  is  a  monotonically  increasing  se- 
quence of  positive  numbers.  Since  (Tu.  ,  Kw)  <  j  u.  j.  Lemma  4.3(b)  im- 
plies that  the  sequence     \  (Tu.      ,  Kw)  i     is  bounded.     Consequently,   the  product 

jj    (1  +  ^.)    converges,   as    i  —  oo.      But,   for    ^.  >  0,     the  convergence  of 

•^  00 

I  j     (1  +  C.)    is  equivalent  to  the  convergence  of  the  series     /        V..      The  con- 

vergence  of  the  latter  implies  that     d  .  —  0,     as    j  -*■  ea.      Since    X.  >  X      and 

t.  >   6,   >  0    we  get 
1—1 

showing  that    X    —  X  ,     as    j  -►  oo„ 

The  proof  of  Theorem  4,  2(b)  follows  from  (4.  44)  and  the  fact  that 

(X^  -  X  )(Su.,Ku.)     ^    (Tu,,Ku.)     -    X,(Su.,Ku.) 

(4.49)  111  11  111 

=    (Tv.,Kv.)     -    X   fSv.,Kv.)      . 
11  111 
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Since  X       is  an  isolated  eigenvalue  of  (4. 1)  and,   by  definition,   the  next  eigenvalue 

^o  =  inf    ,J^'  ^  ,  ,     u  €  D^r\  -4i^.     and    v.   c  D^^^  -"-H^      we  obtain  from  (4.49) 
2  (Su,  Ku)  .  To  1  T  o 

the  inequality    (X^  -  X  )(Su  ,  Ku.)  >  (X    -  X  )(Sv.,  KvJ;      i.e., 

XXX  ^  X  X  X 


x'-x 


l^li  i  x:^  K\l  ■ 


2  1  9 

l"i'  ,      ,2 

To  obtain  the  estimate  in  the  H   -metric  note  that    — r >      v.  I    , 

X  °  ^2       "        '^ 

-  T-  jv.l      <-xlv.L,     and    Ju.        <r-ju.J    .      This  and  (4. 49)  show  that 
X„  '    i'      —  1'    1^1  ^    I'l  —    X     '    i' 


2 


1    I      |2     ^      ,^1       ^    ,1      1 2 


\ 


\mY    >     (^'-^)N|:    =    (Tv.,Kv.)    -    XJSv.,Kv.) 
'i^—  I'l'l  11  111 


^  — ^^ —  I'^A     ' 


(4.51) 


^2         '    ^' 
In  virtue  of  (4.  51)  and  relation  (4.42),   Theorem  4,  2(a)  shows  that 

2  l^il^  ^2     ^'-\ 

lim    sin    [u.,w]    =     lim <     lim    (t—  •  r r—)    =0'; 

1  I      1 2     —  X,      X„  -  X, 

l-*00  1—00       u.  i-^oo         12  1 

'    i' 

i.  e. ,      >  "•  (      converges  in  direction  to    w.      The  error  estimate  (4.  47)  follows 
also  from  (4.  51).     The  proof  of  Theorem  4.  2  is  thus  complete. 

We  see  from  the  proof  of  our  theorem  that  if  the  space    H      is  not  one- 

(        )  ° 

dimensional  then  the  limit  of  the  sequence     >  '-••  5f      depends  on  the  choice  of    u   . 

Let  us  now  consider  an  important  case  when    C  =  S    with    D     =  D      =  D    . 
This  will  happen  for  example  when  we  deal  with  finite  matrices  or  bounded  opera- 
tors in    H.      Let  us  first  summarize  in  Lemma  4.  5  below  the  following  properties 
which  are  valid  in  this  case. 

Lemma  4.  5.      If    C  =  S    and  all  conditions  satisfied  by    C    are  now  valid 
for    S,     then 
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(a)  (Sh,  Ku)-0,     where    u    and    h    are  determined  by  (4.  8)-(4.  10); 

2t  —  st 

(b)  a  =  1,     b  •-=  0,     c  =  d,     p(u,  t)  =  


2,  ' 
1  +  ct 

,   ,      dp        2(1  -  St  -  dt  )       ^     .        ~  2 ^  -  , 

(c)     — ^   -  —^ r-^r =  0    for    0  = <  0    and 

o         (i+dt^)^  ^r^z^ 


Ws^ 


4d 


(d)  min  p  =  p(u,  0  )  =   0  <   0    and    max  p  -  p(u,  0)  =  0  >  0; 

t  t 

(e)  X(h)  -  X(u-0h)  ^  X(h)  -  X(u)  +  X(u)  -  X(u  -  0h)  ^  s  +  d0  =  1/0; 

(f)  The  graph  of    p    is  determined  by  Fig.   1,   Fig.   2j,   and  Fig.   3j; 

(g)  Lemma  4.  1  remains  valid  for    t    in  the  interval:    0  <  t  <   2/(M   -  m  ). 

Theorem  4.  3,      If    C  -  S    and  all  other  conditions  of  Theorem  4.  2  are 
satisfied,   then 

(a)  j  ^    (      converges  to    X     =  X   • 

(b)  )  u.  ^      converges  in  direction  to    w    in  the  H  -  and  H  -metric 


X^  -  X 


ia/2 


(4.52)  l"i-Mi  i  <vrx7>'    NJx    • 

2         1 

Proof.      Let  us  first  note  that    (Su  ,  Kw)  -  — (Tu  ,  Kw)  >  0.       Further- 
o  mo 

more,   in  virtue  of  Lemma  4.  4  and  relation  (4.  46),   we  easily  obtain  the  formula 

i  ~ 

(4.53)  (Su       ,Kw)     -     TT  <1  +  ^HSu   ,Kw)      , 

where    Z.      =      t.(X.-X  )  >  0.      This  implies  that     <  (Su.    , ,  Kw)  >     forms  a 
J  J     J        1    -  (         1+1  ) 

monotonically  increasing  sequence  of  positive  numbers  which,   in  view  of 
Lemma  4.  3(b),   is  bounded.     Repeating  the  argument  in  the  proof  of  Theorem 
4.  2(a)  we  derive  the  validity  of  Theorem  4.  3(a). 

To  prove  Theorem  4.  3(b)  we  use  the  same  argument  as  in  the  proof  of 
Theorem  4,  2(b)  to  get  the  inequality 
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KK     ^■-^- 


'    i'  1 


from  which  we  conclude  that     )  u.  >      converges  in  direction  to    w    in  the 
H^-metric.     Using  t 
we  easily  show  that 


H^-metric.     Using  the  property  of    X      and  the  inequality  (4.3)  for    C  -  S 


2  Nil^       ^i  ^'-\ 

sin    [u.,w]    = <     -r—  •  r r-    -   0    ,  as    i  —  «    ; 

1  I       1 2      —       A.,        A„  -  A, 

ju.J  1         2         1 

i.  eo,      )'-'•(      converges  to    w    also  in  direction  in  the  H  -metric. 

4.  5.     Construction  of  sequences      >  t.  ?     and  special  cases.      The 
general  method  described  in  the  previous  sections  is  not  precise  until  the  choice 
of  the  sequence     \  t.  i     and  of  the  operators    K    and    C  has  been  made.     In  this 
section  we  shall  describe  a  few  methods  of  constructing  the  sequence     ]  ^    \      so 
that  the  conditions  (4.  29)  and  (4.  34)  are  satisfied.     Then  by  specializing  the  opera- 
tors   K,    C,   and  S    we  obtain,   in  particular,   the  most  important  iterative  methods 
used  in  the  approximate  calculation  of  eigenvalues  and  eigenvectors  of  symmetric 
matrices  and  bounded  operators  as  special  cases.     This  will  generalize  at  the  same 
time  these  particular  schemes  to  the  solution  of  eigenvalue  problems  with  unbounded 
operators.     In  what  follows  we  shall  always  assume  that  conditions  (4.  3)  and  (4.  36) 
are  satisfied. 

(i)    The  method  of  a  constant  factor.       If    t.  =  t,     t  is  a  constant,   then  the 
scheme  (4.  8)  reduces  to  the  method  of  a  constant  factor 

u.    ,     =    u     -  th       ,  i=^  0,1,2,...      ,  (4. 8  J 

1+1  11  1 

where    h.    is  the  solution  of  (4. 10)  and    u      is  an  initial  approximation  in    D_.       We 

1  (     i  )  ° 

see  from  Lemma  4.  1  that  for      >  ^    (     to  form  a  decreasing  sequence  it  is  necessary 

to  limit    t    to  the  interval  (4.  24)  or,   if  additional  conditions  (4.  25)  and  (4.  26)  are 

fulfilled,   to  the  interval  ([4.  27),     To  show  that  such  a    t    satisfies  conditions  (4.  29) 
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and  (4.  34)  let  us  put    t    in  the  form    t  =  a/M  ,     0  <  or  <  2.      Then,  by  (4.  28), 

a.s.t  <  a    or    2  -  a  s.t  >  2  -  o;  >  0,      Furthermore, 

I  -  2b.t  +  c.t^  =  |u.-  th  |J/|u.|J  <   (1+tjh.L  /|u.L)^  Ml+tc.^^^)^    and, 
1  1  '    1  i'  1     '    i'  1  —  '    i'  1     •    i'  1  1 

therefore, 

a.s  Of 

(4.54)  p(u    t)    -    ^—-     >         ^^^     y  . 

1  -  2b. t  +  c.t  (1+tc.       ) 

11  1 

Thus,   to  verify  the  inequality  (4.  29)  it  is  sufficient  to  estimate  the  bound  of    c. 
independent  of    i.      Using  (4.  36)  and  the  fact  that  for    t  =  a/M  ,     0  <  a  <  2, 
s  X    >     is  a  monotonically  decreasing  sequence  we  obtain 

(Sh.,Kh.)  (Sh.,Kh.)      (Ch.,Kh.)       .  .o    (Ch.,Kh.) 

(4   55)  c      =    - -     =    - - - -    X^    <     - - - 

^  '  1  (Su.,Ku.)  {Ch.,Kh.)      (Tu.,Ku.)  -     l^    (Tu.,Ku.)      * 

11  11  11  2ii 

On  the  other  hand,   in  virtue  of  (4.  3)  and  (4.  36),   the  operator    T     S    defined  on 
D       is  bounded  in  H   -norm.     Hence,   using  (4.  10)  and  (4.  3),   we  get 

\h.\l    =    (Tu.,Kh.)    -    x\su.,Kh.)    <    (1  +  X°|  T°-^s|)|u  I  I  hJ 
'    i'2  11  1        1      —  '  J    I    jj  J    jj 

(4.56) 


1/2,,       xO|_-l 


<     Mj'^(l-hX°|T-^sl)|uJ|hJ 


2 


Consequently,  by  (4.55)  and  (4.  56)j 


(4.57)       (1+tc^^^)^    <     [l+t(-y-^)^^^(l+X°lT"^s|)]^   =    [1+tM]^   =    M     , 
^  2 

~  1/2 

where    M    denotes  the  upper  bound  of    c.       .      This  and  (4.  54)  imply  the  validity 

of  (4.  29)  with    6  =  t(2  -Qr)/M.      The  inequality  (4.  34)  is  satisfied  trivially.     Let 

us  add  that  the  inequality  (4.40)  is,   of  course,  also  satisfied. 

Thus,   for  the  method  of  a  constant  factor    t    all  the  results  derived  in 
the  previous  sections  remain  valid. 
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Special  case.      If   K  =  S-C  =  I    and   H    is  a  finite  dimensional  space, 
then    T    is  a  selfadjoint  and  positive  definite  matrix.     In  this  case  the  scheme 
(4.  9   )  reduces  to  an  analogous  scheme  for  such  eigenvalue  problem  [  6  ,   37]. 
However,    even  in  this  simple  case  our  results  are  more  general.     If    C  =  I 
and   K    is  symmetric  and  positive  definite  matrix  (or  bounded  operator  in    H), 
then    T    and    S    are  symmetrizable  matrices  (or  operators)  and  the  scheme 
(4.9   )  is  valid  in  this  case  [11].      Also,   many  other  choices  of  the  operators    C 
and    K    are  possible. 

/•io\ 

(ii)    The  method  with  relative  minimal  norms.        If  t    =  l/e.  = 

1      1 

=  (Ch.,Kh.)/(Th  ,Kh  )  =  e  ,  then  (4.8)  reduces  to  the  procedure 
11     111 

u  ,  =  u.  -  e.h.  ,    i=  0,1,2,...   ,  (4.8  ) 

i+1     i     i  i  ^ 

with  h.  determined  by  (4. 10).  This  choice  of  t.  is  suggested  by  the  form  of  the 
function  p(u,  t)  in  (4.  15)  since  one  of  the  basic  requirements  on  t.  is  that  condi- 
tion (4.  29)  be  not  only  valid  but  also  easily  verifiable.     As  will  be  seen  below  this 

is  the  case  when    t.  -   e"  .      Let  us  also  observe  that  when  the  operators    C,  K,   and 

11 

S    have  also  the  property  that    (Su.,Kh.)sO,     i  =  0,  1,  2,  .  .  . ,     then  the  method  (4.  8   ) 

J.  X  Ct 

has  the  following  geometrical  meaning:    The  choice    t.  =   e.    implies  that  the  vector 
t.h.    is  the  orthogonal  projection  in  the  sense  of  the  H   -metric  of  the  iterant    u.    on 

11  o  r-         J  Q  ^ 

h..      Consequently    u  is  obtained  from    u.    by  subtracting  from  it  the  projection 

of    u.    on    h     so  that  the  norm    I  u.    ,1       increases  at  each  step  of  this  process  in 
1  1  '    1+1' 

such  a  way  that  its  magnitude  is  minimum.  For  that  reason  we  call  it  the  method 
with  relative  minimal  norms.  To  verify  our  conditions  (4.  29)  and  (4.  34)  note  that 
by  a  simple  calculation 


In  case    C  -  K  -  S  =  I    and    T    is  a  bounded,   selfadjoint,   and  positive 
definite  operator  in  the  real  space    H    the  method  (4.8   )  was  suggested  by 

Krasnoselsky  [20]  and  investigated  by  Pugachev  [29,   30]    and    Besmertnykh 
[  2  ];    see  also  Altman  [  1  ]. 
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1        Mh  )                            e. 
(4.29   )      p(u     e^)    =   i > > ^   6       >     0     . 

1111  1111  1 

where  we  have  used  the  fact  that  by  (4.  3) 

and  that,   in  virtue  of  the  discussion  m  (i), 

(1  -  2b.e,+  c.e.^)     <     (1+e.c^^^)^     <     [1  +  ^  •  uf   =     M°     . 
1111         —  11  —  M 

Thus,   we  see  that  at  the  same  time  we  verified  conditions  (4.  34)  and  (4.  40)  so 
that  for  this  method  all  the  results  derived  in  Sections  4.  3  and  4.  4  are  valid. 

Special  cases.      If  we  choose    C  -  T,     then    t.  =   e"    =1    for  each    i    and 

1         1 

the  method  (4.  8   )  is  essentially  equivalent  to  the  method 

(Tu.,Ku.) 
(4.8')  Tu.^j    .    XV      .  x'.-jg-L_L,  i  =  0.l,2....      , 

1  1 

which  could  be  regarded  as  a  generalization  of  the  Birger-Kalomy  scheme  [22] 
for  the  unbounded  eigenvalue  problem  of  the  form  (4.  1).     In  particular,   if 
K  -  I,     then    T    and    S    are  Hermitian  positive  definite  operators  and  the  proce- 
dure (4.  8   )  reduces  to  the  method  used  in  [      ].     It  is  obvious  that  by  choosing 
the  operators    K,   T,   and  S    appropriately  we  obtain  all  the  iterative  schemes  of 
the  form  (4.8* )  considered  in  [22].     Hence,   if    u      satisfies  condition  (4.  39), 
Theorem  4.  2  gives  the  convergence  of  these  procedures  and  at  the  same  time  the 
error  estimate. 

Another  suitable  and  important  special  case  of  (4,8   )  is  obtained  when 

we  choose    C  =  S.      In  that  case    D     =  D     =  D    ,     (Su.,Kh.)  -  0    for  each    i,     and, 

in  view  of  (4.  3)  and  (4.  25),   the  choice    t.  =  (Sh.,  Kh.)/(Th.,  Kh„)  =  l/X(hJ    has  the 

11111  1 
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property  that  0  <  l/M  £  t.  <  l/m  .  Thus,  for  this  case  Theorem  4.3  is  valid 
provided  u^  is  of  the  form  (4.  39).  At  the  end  let  us  remark  that  the  choice  of  K 
is  still  at  our  disjjosal. 

(iii)    Acceleration  of  the  new  method.      We  introduce  a  parameter   or  >  0 
in  the  process  (4.  8  )  so  as  to  get  the  scheme 

u^^.^    =    u.    -   ae^h.     ,  i  =  0,1.2,...      .  ^^'^3^ 

Again  a  simple  calculation  and  the  result  from  (ii)  show  that  for  6*  =  \M  (1+  — M)  1 

2      '     1         M  ■■ 

i 

2ae    -  or    e.+a    e   '  r——r  ~    ,„ 

111    X(h.)  e.a(2  -  a) 

p{u.,ae.)    ^ — > 


"i  1  ^       0  0 

1  -  2b  ae^.  +  c.ar    e^  1  -  2b.Qre'.  +  c.(ae'.) 

'     '        '        '  1111  (4   29J 

>      6\(2  -  a)      . 

Hence  conditions  (4.  29)  and  (4.  34)  are  satisfied  for  any   or    in  the  interval 

0     <     Of     <     2     .  (4.58) 

However,   if  additionally  we  know  that  there  exist  constants    Q  >  q  >  0    so  that 

then  the  interval  (4.  58)  can  be  somewhat  extended.     Indeed,   from  the  equality 
(4.  29   )  and  the  inequality  (4.  59)  we  obtain  the  inequality 

"^12  -  "»  -  xb'I              e.a[2-«(X-a)I 
p{u.,Qre.)    = — -     > ^ 


11  ,        o,-     -^  2~2      -     ,        „^     -^  2^2 

1  -  2b  ae.  +  c.of    e.  1  -  2b  ore    +  c.or    e 

1111  1111 


Combining  this  with  the  inequality  (4.  29   )  we  derive 


p<u,flte)     >      6'a'[2  -ar(l--3.)]     >      0  (4.29°) 

11  ^  (^  o 


for   01    lying  in  the  interval 


0     <     or     <      2(1  -  ^)'^      .  (4.60) 
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Special  cases.      (j)    If   K  =  C  =  S  =  I    and    T    is  a  bounded,    selfadjoint,   and 
positive  definite  operator  satisfying  condition  (4.59),   then  our  results  yield  in 
this  case  the  corresponding  result  of  Altman  [  1  ]. 


(jj)    If  in  addition  to  conditions  in  (j)  we  take   a  =  2,     which  clearly  be- 
the  interval 
normal  chords  [19]. 


longs  to  the  interval  (4.  60),   then  the  procedure  (4.8   )  reduces  to  the  method  of 


(19) 

(iv)    The  generalized  method  of  steepest  descent.  If   t.  -9.,     where 

9.    is  determined  by  (4.  23),   then  in  view  of  Remark  9     the  choice    t.  =  0.    has  the 
1  11 

property  of  minimizing  the  functional    X(u.      )    so  that  the  scheme  (4.  8)  reduces  in 
this  case  to  the  generalized  method  of  steepest  descent 

(4.8)  u.^^     =    u.     -    9.h.      ,  i  =  0,  1, 2,  ...      , 

4  1+1  1  11 

which  is  a  well-known  procedure  in  case  of  finite  symmetric  and  positive  definite 

matrices  [15,     3  ].      Let  us  observe  that  according  to  the  graph  of    p   the  roots 

0.    can  be,   in  general,  both  positive  and  negative.     Since    p.  =  p(u.,  0.)  >  0    and, 

by(4.22),     1/p.  =  1/9.  -  b     <   jl/9.j+jb.|     we  see  that  to  verify  the  validity  of 
1  1         1  —   '         i'       '    i'  j f        "^ 

(4.  29)  it  is  sufficient  to  show  that    jl/9.j   =^  la.s.  +'i/(a.s.)    -  4(a.b.s.-c.  I  /  2    and 

'         i'        'ii»ii  iiii' 

jb.j    possess  upper  bounds  which  are  independent  of    i. 

To  that  end  note  that  by  definition  of   b.    and  (4.  57)  we  have 

1 

|b.|   <    |h.|    /ju.j      <   M    while  by  (4.28),   (4.3),   (4.36),   and  Remark  9    we  have 

ja.s.     <   M.  +  X   /£„.      Furthermore,   this  implies  that 
•    1  i'   —       1  2 

la.b.s.  -  c.J      <      lb. I  la. s. I    +    c.     <     (|a.s.|+ 

'ill         i'     -     '   i'  '    1  i'  1     —      I      I        '    1  i'        11 

<      M(M    +  y-   +  M)      . 
2 


(19) 

This  method  was  investigated  for  the  nonbounded  operators  by 

Samokish  [33]  for  the  case  when    K  =  S  =  I    and    T°      and    C        are  compact 
selfadjoint  operators. 
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Thus,   the  above  inequalities  imply  that    jl/9.|     and    jb.j     have  upper  bounds 
independent  of    i.      Consequently,   there  exists  a  constant    6     >  0    so  that 

p(u^,  9^)     >      64     >     0      ,  1  =  0,  1,  2,  ...      ;  (4. 29^) 

i.  e. ,    condition  (4.  29)  is  satisfied.     Hence,   Lemma  4.  2  is  valid  in  this  case. 

Remark  12.      Let  us  observe  that,   under  present  very  general  conditions 

on    C    and  the  arbitrariness  of    u  ,     the  investigation  of  the  graph  of    p{u,  t)    and 

especially  of  the  Cases  3i  and  3j  show  not  only  that    9      can  be  positive  and  negative 

but  also  that  they  can  be  arbitrarily  large  since  the  denominator  in  the  expression 

7"    can  be  arbitrarily  small.     Consequently,   without  further  conditions  on  either  the 

choice  of    C    or  the  initial  element    u      or  on  both  we  cannot  expect  the  condition 

o 

(4.  34)  to  be  satisfied.     We  shall  return  to  this  problem  at  some  later  time. 

(v)    Accelerated  method  of  steepest  descent.      If   t.  -  aQ,,     then  we  obtain 
£ 1  1 

the  accelerated  method  of  steepest  descent 

u.    ,     =    u.    -    Qr9.h       ,  i  =  0,  1, 2,  ...       .  (4. 8_) 

1+1  1  1  1  5 

2 
Using  the  fact  that    b.a.s.9.   -  s.a.9.  -  c.0.  -  1,    it  is  easy  to  see  that 
^  111111111 


p(u.,Qf9.)     = 


aQ  (1  -  b.9.)(2  -  a.s  a9.) 

1  11  111 


''       '  1-Vi      (l-2b.c.9.  +  c,aV) 

11         11 

a9. 
=     ^  _  ^\     [1  +(l-«)g(u..9^,Qr)]      , 

i   i 


(4.61) 


where 


2 
1  -  2b. 9.  +ac  s. 

g(u,9,a)=  — ^^— -     ,        1  =  0,1,2,...       .         (4.62) 

1  -  2b.ar9    +a   c.9. 
11  11 

Formulas  (4.  61)  and  (4.  62)  suggest  that  we  take   or    in  the  range:    0  <  a  <   1. 
Furthermore,   in  order  for    g    to  be  positive,    a    must  also  satisfy  the  condition 
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2b. 0.  -  1 

(4.63)  '  '  <    a    <1      . 

c.e. 
1  1 

2  2 

Since    1  >  (2b. 9   -  l)/c.e.     and    (2b.9   -  l)/c  9.    >  0    for    9.  <   0    we  must  con- 
11  11  11  11  1 

elude  from  this  and  (4i  63)  that  we  cannot  take   ct    so  that    0  <  or  «   1.      Thus, 
we  have  the  justification  of  the  suggestion  derived  experimentally  [37]  that  we 

take   a    so  that    0  «  a  <   1.      As  a  matter  of  fact,   we  should  take    o    so  that 

2 
0  «  or  <   1    if    1  -  2b. 9.  +ac.9.    >   0    and   a  -  1,     otherwise. 

11  11 

If  these  conditions  are  satisfied,  then  from  (4.  61)  and  (4.  29   )  we  get 
p(u.,a9.)     >     a8        >     0      ,  i  ^  0,  1,  2,  .  .  .       . 

Special  case.      An  important  special  case  is  obtained  when    C  =  S,     i.  e. , 

2  2       2 

when    b  =  0.      In  this  case    p(u.,o'9.)  =  Qr9.[l  +  (1  -a)(l  +ac.9. )/(!  +a   c  9  )1  >  ^9. 

11  1  11  11—1 

for   a    in    0  <  o;  <   1.      The  form  of    p  suggests  that  in  this  case  we  also  take  or  tobe 
near    1. 

(vi)    A  modified  method  of  steepest  descent.      We  have  seen  in  (iv)  that 

without  further  conditions  on    C    and    u      the  choice    t.  =  9.    will  not  satisfy  the 

o  11  /or\\ 

inequality  (4.  34).     However,   the  following  modification  of  procedure  (4.  8   )         , 

called  here  a  modified  method  of  steepest  descent,   will  be  shown  to  satisfy  all 

our  conditions. 

Let  us  determine  a  sequence    t.  =  t(u.)    as  follows:    Choose  an  arbitrary 
fixed  constant    6     >  0    and  define    t.    by 

if    0  <  9.  <   6 

(4.64)  t.    -      '       ^  '  ~     ^ 

6„      ,     if  no  such  root  exists, 
b 

This  sequence  is  simple  to  construct  since  it  involves  only  the  solution  of  (4. 19) 


(20) 

In  case    K  -  I    and  other  operators  are  finite  symmetric  and  positive 

matrices  the  method  was  proposed  by  Hestenes  and  Karush  [11]. 
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and  a  comparison  of  numbers.     It  follows  from  our  construction  of   t.    and  the 

discussion  in  (iv)  that  there  exists  a  constant    6     >  0    such  that 

5 

6.      <     T      <      6  i  =  0,  1,  2,  .  .  .       . 

5     —       1     —        6 

Thus,    conditions  (4.34)  and  (4.40)  are  fulfilled.     Furthermore,   the  graphs  of 
p(u,  t)  and  (4.  64)  show  that    p(u,  t)    is  an  increasing  function  on  the  interval 

0  <  t  <  t(u.).       Consequently,   we  can  select  a  number    6     >  0    so  that    6     <   6 

-^  -1 

and    6^    e   (0,  2M,    )    and  for  which,    in  virtue  of  (4.  28)  and  (4.  57),   we  have 
5  1 

T  (2  -  a.s.I)  "6,(2  -  "s^m'S 

,      -Tv      ^        /       T\  5 1  1   5  .  5 5     1 

p(u.,t.)      >      p(u.,6   )      =      — — —     >      o         ^      °      • 

a  -  2h.6.  +  c.(6n  (1  +  6    M) 

1   D         1      5  5 

Thus,   we  see  that  in  this  case  conditions  (4.  29)  and  (4.  34)  and  even  (4.  40)  are 
satisfied. 

We  shall  end  this  chapter  by  observing  that  the  form  (4.  15)  of    p(u,  t) 
suggests  another  choice  of    t.,     namely,     t.  =  l/ja.s.j,     so  that  (4.8)  reduces  in 
this  case  to 

h. 

u.    ,     =    u.    -     — ^     ,  i    =   0,1,2 (4.8   ) 

1+1  1  as.  1 

'    1  1^ 

It  is  easy  to  see  that  when      h.  L  /   u.  1       is  small,   as  will  be  the  case  when    u.    is 

'    i'  1    '    i'  1  1 

near  the  solution,   then  the  choice    t.  =  1/   a.s.      is  a  good  estimate  of  the  optimum 

1  '    1  i' 

value  of    0..       It  can  be  shown  that  when  we  choose  the  operators    K    and    C    appro- 
priately and  assume  that    u      is  so  chosen  that  for  some    i    we  have    X    <  X  ,     then 

(       1         )        ° 
the  sequence      , ^      satisfies  conditions  (4.29),    (4.34),   and  (4.40).     The  dis- 

^  la.s.  I   ' 
'    1  i' 

cussion  of  the  last  statement  as  well  as  the  investigation  of  the  closely  connected 
problem  indicated  in  Remark  12  will  be  taken  up  at  some  later  time. 
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